Gene Lcho_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3071 
Symbol 
ID6162634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3398745 
End bp3399926 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content60% 
IMG OID641665846 
Productprotein involved in cellulose biosynthesis (CelD)-like protein 
Protein accessionYP_001792096 
Protein GI171059747 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCAA CGCAATACGG CATCTCATTT GATGCGCTTC CGCCCCTGAA CGAACTTGAA 
GCTCTCTGGC GCGATCTTGA GTTGCGCGCT CCAGATGCCA GTTTCTTCAA CTCGTGGTCT
TGGATCGGCT GTTGGCTCGA GTTGCTGCCG GACCAGTTCG AGCGTCGTCT GCTCAAGGCC
GTCTCTGGTG GACGTGTCGT TGGGCTCGGT GTCCTGGTGC GCAACACGCG AAAGTTGGGC
GGAATGCCGT TCTGCACGGC TTGGCACCTG CACGCTGCGG GAGATCCCAT CTACAACGGT
GCGATGGTCG AGCACAATGA TTTTCTGCTG GACGGTCAAC ATGGCGACGC CTTGCGGGAG
GCCCTTGTCA AGCGGTGGGC CGACTGCGTG GGCGCCGGTC AAGAGTTGCA CCTGCCCGGT
CTCGAAGGGC ACGGCTACTC TGCCGAAGTG AGCGGAAACC TGGAGCGTCA CGATGAGCAG
CGCATGTCCT ATGCGATTGC GCTCGAACCT GTTCGAGCGC ACAAGCTTGA TTTCACGCCT
TTGGTGAGTG GCCACGCTCG GCGGTTCATC CGTCGCAGCA TCAAGGAGTA TCAGACCCTG
GGTCCGATCG AGGTGACTGT CGCCGTTGAT GTTGAACAGG CACTGAGTTT CTTCGACAAG
ATGGTGGCCC TGCATCAGGA TCGCTGGGCG GCTCTCGGCG AAGATGGCTC ATTCAAGAGC
GAATTCCGGT TCCAACTCCA TCGGCTGGTC ATTGCGCGTC AGTTGGCGCG GGGCGAAATC
CAGATGCTGC GGGTCCGAGC CGGTGAGCGG GATGTCGGGT ATCTCTACAG TTTCATACGA
GGGAAGCGAC TTTACGTCTA TCAGTCCGGT TTCGATTACA CCGTGCTGGA GAAGCACGGC
CGTCCCGGCT TGGTGACTCA TACCTTGGCG GTGCAGCACA ACGCGGCTCT CGGCTTCGAT
GTCTATGACT TGATGGCCGG TGAATCGCAG TACAAGTCCA CCATCTCGAC GGTGCACGAG
ACGTTGACAT GGTCGGTCTG GCGCAAGCCC GCGATCCGGT TCGCGGTCGA GCGACAACTC
CGCAGTGCTG TTGGAAGCTA TCGACGCTGG CGTGCTGCGC GAGTCGATAA GGCCTCGGGT
CCCGCTCAGG AAGAAGCCAG ACAGGCTGCC GAGGAGGCAT GA
 
Protein sequence
MAATQYGISF DALPPLNELE ALWRDLELRA PDASFFNSWS WIGCWLELLP DQFERRLLKA 
VSGGRVVGLG VLVRNTRKLG GMPFCTAWHL HAAGDPIYNG AMVEHNDFLL DGQHGDALRE
ALVKRWADCV GAGQELHLPG LEGHGYSAEV SGNLERHDEQ RMSYAIALEP VRAHKLDFTP
LVSGHARRFI RRSIKEYQTL GPIEVTVAVD VEQALSFFDK MVALHQDRWA ALGEDGSFKS
EFRFQLHRLV IARQLARGEI QMLRVRAGER DVGYLYSFIR GKRLYVYQSG FDYTVLEKHG
RPGLVTHTLA VQHNAALGFD VYDLMAGESQ YKSTISTVHE TLTWSVWRKP AIRFAVERQL
RSAVGSYRRW RAARVDKASG PAQEEARQAA EEA