Gene Cthe_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3047 
Symbol 
ID4811119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3572361 
End bp3573773 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content47% 
IMG OID640108468 
Productpeptidoglycan glycosyltransferase 
Protein accessionYP_001039436 
Protein GI125975526 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0768] Cell division protein FtsI/penicillin-binding protein 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000101617 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGA ATAAAAAGAT TATTCAGGTA CTGGTTGCAA TATGTTTTCT CTTTTTCATT 
ATAGTCGGCT ATCTTACGTA TATTCAGCTG TTCAGAAGCC GGGATTTAAT GGCCAATGTA
TATAACAGGA GGCAATATAA AATAGAGGAG AATACGGCAA GGGGCAACAT ATATGACAGG
AACGGAGTCT TGCTTGCTTA CAGTGAAGCA AACGGTGAGG TGCAGGAGAG AATTTACCCC
TATGGGGCAC TCTACAGCCA GGTAATCGGT TACAGCTCAA AAGTGTATGG GAAATCGCAG
ATTGAAGCCG CTTACAATAA TGTGCTGCTG GGGATTGACG ATTTGAGCCA GGTGTTTGGA
ATGGTCAGCG GTTCCAACTA TCCCACACGG AAGGGAAACA ATCTTTATCT GACCATTGAC
CATAAGCTGC AGGCTCTTGG GGGAGAGCTG CTTAACGGAC GAAAGGGAGC CGTGGTGGCC
ATGGACCCTA AAACGGGGGA AGTTCTTGCT TTGGCAAGCA GTCCGAATTT TGACCCCCAC
GCGAAAAAAC TTGAGGAAAA CTGGCAGAGT ATGATTGAAT CCCCGGATGC TCCTTTTCTT
TGCCGTGCCA CCCAGGGACT GTATGCCCCG GGTTCTACTT TTAAAATACT GGTTACTGCC
GCGGCCGTTG AGAAAGGACT GGAAAGTAAA GTGTTTGATG ACAACGGTTC TGTAGTTATT
GACGGAAGGG AAATAAGAAA CTCGGAAAGC AGAGCATACG GCAAGATTGA TTTGAAGAGG
GCCCTTGCGG TGTCAAGCAA TGTGGTGTAC GCCCAGCTGG GGACGGAGCT TGGCATGGAA
AGCTTTGTGG ATATAACATA CCGTGCCGGA TTTGAGAAGG AAATTCCTTT TGACATTCCG
ACGAGCAAAA GCCGTTTTCC TTATGAAAAC ATGAATAAAA TCGATTTGGC CGAAGCGGCT
ATAGGCCAGG GCAAGGTATT GGTAAGTCCC CTTCACATGG CAATGATTAC ATCGGCTATT
GCCAACGAAG GTGTCATGAT GGAGCCGGTA CTGGTTAAAA GCATTACAAA TTCCGAGGGC
AAAACGACTA AAGAGCTAAA ACCTGTGAAG CTTGGCAATG TAATGGAGAA AAGTGTCGCT
GAGAAAATTA AAATAATGAT GCAGGAGGTT GTGACTTCCG GTACCGGGCA TAATGCGGCG
ATTAAAGGAA TTAATGTTGC CGGCAAGACC GGAACGGCGG AAAACGAGCT GTCGGTTAAA
AAGAAGGCAA AGGCACATTC GTGGTTTGTA GGTTTTGCGC CGGCTGAAGA CCCCAAAATT
GCTGTGGCTG TCATCGTTGA GTACGGAGGT TCCGGCGGTG ATGTTGCGGC TTCCATTGCA
AGGAGGATAA TGAGCGAATA CCTTTCCAAT TAA
 
Protein sequence
MDENKKIIQV LVAICFLFFI IVGYLTYIQL FRSRDLMANV YNRRQYKIEE NTARGNIYDR 
NGVLLAYSEA NGEVQERIYP YGALYSQVIG YSSKVYGKSQ IEAAYNNVLL GIDDLSQVFG
MVSGSNYPTR KGNNLYLTID HKLQALGGEL LNGRKGAVVA MDPKTGEVLA LASSPNFDPH
AKKLEENWQS MIESPDAPFL CRATQGLYAP GSTFKILVTA AAVEKGLESK VFDDNGSVVI
DGREIRNSES RAYGKIDLKR ALAVSSNVVY AQLGTELGME SFVDITYRAG FEKEIPFDIP
TSKSRFPYEN MNKIDLAEAA IGQGKVLVSP LHMAMITSAI ANEGVMMEPV LVKSITNSEG
KTTKELKPVK LGNVMEKSVA EKIKIMMQEV VTSGTGHNAA IKGINVAGKT GTAENELSVK
KKAKAHSWFV GFAPAEDPKI AVAVIVEYGG SGGDVAASIA RRIMSEYLSN