Gene Lcho_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1787 
Symbol 
ID6161481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1929502 
End bp1930722 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID641664549 
Producthypothetical protein 
Protein accessionYP_001790819 
Protein GI171058470 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000418833 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACATT TCGACGTGGT GGTGGTCGGC GCAGGTGCGG CCGGCCTTTT TTGTGCGGGC 
GTCGCCGGCC AGCGGGGCCT GCGCGTGCTG CTGCTGGACC ACGCCCCCAA GCTGGCCGAG
AAGATCCGCA TCTCCGGCGG CGGGCGCTGC AACTTCACCA ACCGTGAGGC TGCGCCCGCC
AACTTCCTGT CCGACAACCC GCACTTCTGC CGCTCGGCGC TGGCGCGCTA CGGCGCCGCC
GACTTCATCG CGCTGGTGCG CCGCCACGGC ATCGCCTTCC ACGAAAAGCA CCGCGGCCAG
CTGTTCTGCG ACCACAGCGC CGAAGACATC ATCACGATGC TGCTGCGCGA ATGCGAAGCC
GGCGCGGTGG TGCGTCGCCA GCCGTGCCGC GTGCAGGCCG TGCGCCATGT TGCCGACGGC
CATGAACTCG ACACCGACGC CGGGCCGGTG CGCACGCACC AGCTCGTCAT CGCCACCGGC
GGCCTGCCGA TCCCCAAGAT CGGCGCCACC GACTGGGGCC TGCGGCTGGC CGAGCGCAGC
GGCCACCGCA TCGTCGCCCC GCGTCCGGCG CTGGTGCCGC TGACCTTCGA CGCGCAGACC
TGGGCGCCCT ACGCGGCGCT CGCCGGCCTG GCCTTGCCGG TGCAGATCAG CACCGGCAGC
GGCAAGCAGC GCACGGTGTT CCAGGAGGAT CTGCTGTTCA CGCACCGCGG CCTGAGCGGC
CCGGCGGTGC TGCAGATCTC GAGCTACTGG CGCCCCGGCC AGGCGCTGAG CATCGACCTG
ACCCACGGCG GCGATCTGGG TGGCGCCCTG CTGGCGGCCA AGCTGACGTC GAAACGGCAG
CTCGGCAACG AACTGGCGCA GCATCTGCCC ACCCGCCTGG CCGATGCCTT CCTGGCCGGA
TCAGGCCTCG ACGCCCACCG CCCGATGCCC GACTGCCGCG ATCGCGACCT GCAACAGCTC
GCACAGCGTC TGCAGGCCTG GCCCATCACG CCCAACGGCG ACGAGGGCTG GAAAAAGGCC
GAGGTGATGG CCGGCGGCGT CGACACCCGC GACCTGAGCT CGCAGACCCT CGCCAGCCGC
CACGTGCCGG GGCTCTATTT CATCGGCGAA GCGGTCGACG TGACCGGCTG GCTGGGCGGC
TACAACTTCC AGTGGGCCTG GTCGAGCGCC TTTGCCTGCG CGCAGTCGCT GCAGCCGCAA
ATGGCGGGCG CGACCGCTTG A
 
Protein sequence
MEHFDVVVVG AGAAGLFCAG VAGQRGLRVL LLDHAPKLAE KIRISGGGRC NFTNREAAPA 
NFLSDNPHFC RSALARYGAA DFIALVRRHG IAFHEKHRGQ LFCDHSAEDI ITMLLRECEA
GAVVRRQPCR VQAVRHVADG HELDTDAGPV RTHQLVIATG GLPIPKIGAT DWGLRLAERS
GHRIVAPRPA LVPLTFDAQT WAPYAALAGL ALPVQISTGS GKQRTVFQED LLFTHRGLSG
PAVLQISSYW RPGQALSIDL THGGDLGGAL LAAKLTSKRQ LGNELAQHLP TRLADAFLAG
SGLDAHRPMP DCRDRDLQQL AQRLQAWPIT PNGDEGWKKA EVMAGGVDTR DLSSQTLASR
HVPGLYFIGE AVDVTGWLGG YNFQWAWSSA FACAQSLQPQ MAGATA