Gene Lcho_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1386 
Symbol 
ID6159841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1470768 
End bp1471976 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID641664140 
Producthypothetical protein 
Protein accessionYP_001790419 
Protein GI171058070 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCA GGGCGATCCG GACACTGCAG GAACTGGCGC CGGTGCGCGC GCGCTGGCAG 
CAATGGCAGG ACCACGTCAA CAACGATCTG GCGCAGTTCG AGCTGGTGTG CCGGCACCGC
ACGGAAGTCG AATCGCCGTG CGTGATCGTG ATCGAGCAGG ACAGCGAAAA CAACCGCGAC
AGCGGCCCCG ATGCGCTGCT GCTCGGGCGC ATCGAGTGCA ACCCCTTCGC GCCGTCGATC
GGCTATCTGC AGCCGGTGCG CATGCCCGCG CGGGTGCTGG TCGTGATCCA TCAGGGCTTG
CTCGGCAAGC TCGACGACGC AGCCGCCGGC GAGGTCATCG GCTACCTGCG ATCGCTGTTG
CGCAGCGGTG TGGCCGATGC GGTGGCCTTT CATCATCTGC CGGAACACTC GCCGCTGTGG
CAGGCGCTGC AGATCGAGCG CGACACAAGG CTGAGCGTGA AGGCGCCGAG GTGGGCAACC
CACCACGAGA TGCGGCTGCC CGACGACGGC CGCTCGGTCG ACAGCAAGCT CAGCGCCAAG
CACCGCAGCA ACATGCGCCG CCATCAGAAG GACCTCGAGG CGGCTTTCCC GGGCCGGGTG
GTCTGGCGCT GGATGAACGC CGTCGACGAC ATCGCCGCGC TGTGCGCACA CCTGGAGCCG
TTGGCTGCAC GCACCTATCA GCGCGCGCTG GGTGTCGGCT TCTTCGACGA CGACGACCAC
CGGCGCCGCT ACGAGCTGTT TGCGCGCCGC GGGCAATTGC GGGTGCAGCT GCTGGAGATC
GACTCGCAGG TGCGGGCCTT CTGGATCGGC TCGATCTATG CGGATGTCTT CAACCTGTCC
GAGACCGGCT ACGACCCGGA TCTGCGCGAG TTCAAGGTCG GCACCCTGCT GTTCATCCGG
CTGGCCGACG CGCTGGCGCA AGAAGGCGTG CGACGGCTCG ATTTCGGCCT CGGCGACGCA
CCGTACAAGG CGCGCTTCGG CGACCGGAGC TGGCGCGAGA CACCGGCCTG GCTGTTCGCC
CCGACCGCCA GGGGCATGGC CATGATGCTG CTGCTCAAGC TGTCGCTGGC GCTCGACTCC
GGGGCACGGC GCCTGGTGCA GCACGCCGGC CTGACCGACC GGATCAAGAC CGGCTGGCGA
CGCCGCAAGG CTGCGTCCGG CACTCGGCTG ACGCCGAGCC ACCCTGCCAC TGCGAGGGAT
CGAGCATGA
 
Protein sequence
MRIRAIRTLQ ELAPVRARWQ QWQDHVNNDL AQFELVCRHR TEVESPCVIV IEQDSENNRD 
SGPDALLLGR IECNPFAPSI GYLQPVRMPA RVLVVIHQGL LGKLDDAAAG EVIGYLRSLL
RSGVADAVAF HHLPEHSPLW QALQIERDTR LSVKAPRWAT HHEMRLPDDG RSVDSKLSAK
HRSNMRRHQK DLEAAFPGRV VWRWMNAVDD IAALCAHLEP LAARTYQRAL GVGFFDDDDH
RRRYELFARR GQLRVQLLEI DSQVRAFWIG SIYADVFNLS ETGYDPDLRE FKVGTLLFIR
LADALAQEGV RRLDFGLGDA PYKARFGDRS WRETPAWLFA PTARGMAMML LLKLSLALDS
GARRLVQHAG LTDRIKTGWR RRKAASGTRL TPSHPATARD RA