Gene Lcho_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0602 
Symbol 
ID6159880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp651711 
End bp653261 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content72% 
IMG OID641663352 
Productprotease Do 
Protein accessionYP_001789642 
Protein GI171057293 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000166487 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGAAC ATTCGTCGCG CACGAGCGTG CACAAGCGAT CCGATTCCAC CCCTCGCAGC 
GTTGCCGCCC TGCGCTGGCT CGCCCCCCTG CTGACGGCGG CGTGCCTGCT GCCGACACCG
GCCCTGGCCC AGTCGGGCGG CGCGCAGGCT GCAGCTGCCC AGGCCGGCGC GGCGGCACCG
CTGGTGCGCG GACTGCCCGA CTTCACCGAA CTGGTCGAGC AGGTCGGCCC GGCGGTGGTC
AACATCCGCA CCACCGCCAA GGCCCGCACG GCGCGCTCGG ACAACCCGGC CGACGAGGAG
ATGCAGGAGT TCTTCCGCCG CTTCTTCGGC GTGCCGATCC CGCGCCAGGG CCCACGTCAG
GGGCCGCCCG GCCAGGGCCA GAGCGAAGAA GAAGCGGTGC CGCGCGGCGT CGGCTCGGGC
TTCATCGTCA GCAGCGACGG CTTCGTGATG ACCAACGCGC ATGTGGTCGA GGGCGCCGAC
GAAGTCACGG TGCGCCTGAC CGACAAGCGC GAGTTCAAGG CCCGCGTGGT GGGCGCCGAC
AAGCGCACCG ACATCGCGGT GCTCAAGCTC GACGCCACCG GCCTGCCGGC GGTGCGCCTG
GGCGACGTCA GCCGTCTCAA GGTCGGCGAA TGGGTGATCG CGATCGGCTC GCCCTTCGAT
CTCGACAACA CGGTGACGGC CGGCATCGTC AGCGCCAAGG CGCGTGACAC CGGCGACCTG
GTGCCGTTCA TCCAGACCGA CGTGGCGATC AACCCCGGCA ACTCCGGCGG GCCGCTGATC
AACCTGCGCG GCGAGGTGGT GGGCGTGAAC TCGCAGATCT ACAGCCGCTC GGGCGGCTAC
ATGGGCATCT CGTTCGCGAT CCCGATCGAC GAGGCCAGCC GCGTGGCCGA CCAGCTGCGC
ACCAGCGGCC GGGTGGTGCG CGGGCGCATC GGCGTGCAGA TCGGCGAGGT CACCAAGGAC
GTGGCCGAGT CGCTCGGCCT GGGCAAGGCG GCCGGCGCGC TGGTGCGCTC GGTCGAGGAC
GGCAGCCCGG CCGGCAAGGC GGGCCTGGAA GCCGGTGACA TCGTGACGCG CTTCGACGGC
AAGCCGGTCG AGAAGTGGAA CGACCTGCCG CGCCTGGTCG GCAAGACCGC ACCGGGCACC
AAGACCACGA TCCAGGTGTT CCGCCGCGGC AGCATGCGCG ATCTCAGCGT CACCGTCGCC
GAGCTCGAAG CCGAAGCCGC CGCCAGGCCG GCCAGCACCG AGCCCGCACC GGCCAAGCCG
GCCGCACCCG CCACGGTCAG CCTGCTGGGC CTGACGGTGA GCGACCTGAG TGCCAAGCAG
CGCGAGGAGC TCAAGGTCAA GGGCGGCGTG CGGGTCGACG CGGTGGACGG CGCGGGCGGG
CGGGCCGGCC TGCGCGAGGG CGACATCATC CTGGCGGTGG CCAACACCGA GATCACCAAC
CTGCGCCAGT TCGAGGCGGT GGTCGGCAAG CTCGACAAGA GCAAGCCGGT CAACCTGCTG
TTCCGCCGCG GCGAGTGGGC CCAGTACGCG GTGATCCGGC CCGGCAAATG A
 
Protein sequence
MSEHSSRTSV HKRSDSTPRS VAALRWLAPL LTAACLLPTP ALAQSGGAQA AAAQAGAAAP 
LVRGLPDFTE LVEQVGPAVV NIRTTAKART ARSDNPADEE MQEFFRRFFG VPIPRQGPRQ
GPPGQGQSEE EAVPRGVGSG FIVSSDGFVM TNAHVVEGAD EVTVRLTDKR EFKARVVGAD
KRTDIAVLKL DATGLPAVRL GDVSRLKVGE WVIAIGSPFD LDNTVTAGIV SAKARDTGDL
VPFIQTDVAI NPGNSGGPLI NLRGEVVGVN SQIYSRSGGY MGISFAIPID EASRVADQLR
TSGRVVRGRI GVQIGEVTKD VAESLGLGKA AGALVRSVED GSPAGKAGLE AGDIVTRFDG
KPVEKWNDLP RLVGKTAPGT KTTIQVFRRG SMRDLSVTVA ELEAEAAARP ASTEPAPAKP
AAPATVSLLG LTVSDLSAKQ REELKVKGGV RVDAVDGAGG RAGLREGDII LAVANTEITN
LRQFEAVVGK LDKSKPVNLL FRRGEWAQYA VIRPGK