Gene Lcho_3070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3070 
Symbol 
ID6162633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3397431 
End bp3398591 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID641665845 
Productcellulose biosynthesis protein CelD 
Protein accessionYP_001792095 
Protein GI171059746 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACG GCTGGCAGTT TTCGTGGTCG AGCGGTATCG CAGGTTTGAG GGAACTGGCG 
CCAGCATGGC AGGCGCTGGC TGACTCGCTC CCCGATGCCG AGTACTTTCA ACGTCCTCAG
TGGTTTCATG CGCATCAGGC GATAAACGAA AATCCGGAAA AATCGATTTG GGTTTCCGTT
CATCACGAGG GCCAGTTGAA GGCTGTGTTT GCGTTGCAGT CCGTGGTGCG AAAGGTGGGG
CCGCTGCGTG TGCCCGAACT TCGCTTTGTC AATCACGGGC ACATGACACT GTCAGATGTC
TGTGCAGATC GGGCCGATGT GACACTGTGG CCCGCGTTCT GGAATTGGTT GCAGGGGCGT
GATGCGCCCG AGTGGGACCG GTTTGTCTTG CCTCAGATTC CCGCTGATGG CGTCATGGCG
GCTTGGTTGC AGCACTTCGC GCCGCAGCGG ATGTTGCACT CAGTGGCTTC CAGCAGTGCG
CGCGTCGACT GCCGGCGCTC GATGGAAGAA CTGCTGAAGT CATGCAGTGC AAATCACCGC
AGCAGTGTGT CACGGGGGGG CAAGCGCGCA GAAGCCCTGG GTCCTCTTCG GTATGAACTT
GCCCGCAGCC CTCAGGATCT GGCTCGTCTG ATGCCGATTT TCCTCGCCAT CGAGGCGTCA
GGATGGAAGG GTGCGGCGGG CAGTGCGGTG GCCAGCAACC CGGCCTTGAT GCGGTTCTAC
AACGCTCTGC TGGACGGATT CGGCTCGCGC GGCCAGTGTG AAATCGACGT GCTCCATGTC
GGTGAACGTC CGGTTGCGTC GGTGCTCTGG TTTCGAACCG GGCGTCAGAT CCACCTGCAG
AAGATCGGCT ATCTGGAGGA ACTCTCGCAG ATCGGCCCCG GCAAGCTGCT CTTGCGCGAG
ACCTTCAAGC GGGCCTGTGA AGATCCGGAA CTGGATCGTC TGTGTTTCAT CACACATCCG
GCATGGGCCG ATCCCTGGCG GCCGGAGGGC AATCCCGTGC TGGAGTTCAC CTTGTTCCGG
GACAACTGGC GGGGTCTGGT GCTCTACCAG TTGAACAAGG CAAAACGGGC CCGGGCAGCT
CGACTCGGGC AGGCGCAGGC ACGCAAGAAC CCTGGTCCCG AGAATTCGGA GCACTCGCCT
GAGGCGATTG CCGAACGATA G
 
Protein sequence
MSNGWQFSWS SGIAGLRELA PAWQALADSL PDAEYFQRPQ WFHAHQAINE NPEKSIWVSV 
HHEGQLKAVF ALQSVVRKVG PLRVPELRFV NHGHMTLSDV CADRADVTLW PAFWNWLQGR
DAPEWDRFVL PQIPADGVMA AWLQHFAPQR MLHSVASSSA RVDCRRSMEE LLKSCSANHR
SSVSRGGKRA EALGPLRYEL ARSPQDLARL MPIFLAIEAS GWKGAAGSAV ASNPALMRFY
NALLDGFGSR GQCEIDVLHV GERPVASVLW FRTGRQIHLQ KIGYLEELSQ IGPGKLLLRE
TFKRACEDPE LDRLCFITHP AWADPWRPEG NPVLEFTLFR DNWRGLVLYQ LNKAKRARAA
RLGQAQARKN PGPENSEHSP EAIAER