Gene Cthe_0977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0977 
Symbol 
ID4811271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1167087 
End bp1168460 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content40% 
IMG OID640106395 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_001037402 
Protein GI125973492 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000230391 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATTT TAAAATGCGA AGAAGTAGTA AAAGCGGTTG GCGGCACTTT AATATCCGGA 
GAAGTCAATA CTGTTTTTTA TAACATTTCC ACCGATTCGA GGAATATAAA ACAGGGAGAT
TTGTTTATTC CTCTTATTGG AGAAAGATTT GACGGACACA ACTATATTGC GTCTGCTTTG
GAGCATGGAG CTCTGGGCAG CCTTACTCAA AAGGAAACGG AACCATTTCC CGGCAAAGTT
TTAATAAAGG TTTCGGATAC ACTTAAGGCT TTAAGGGATC TTGCGGTGTA TTACAGACAA
AAATTCAAGA TCCCTTTTGT AGGAATTACC GGAAGTGTGG GGAAAACAAG CACCAAGGAA
ATGGTTGCGG CAGTGCTGTC AAAAGGCTTC AAGGTGCTGA AGAACCAGGG TAATTTTAAC
AATGAAATCG GTGTGCCTCT TACAATTTTC AACCTTGACA AATCCCACGA GGCTGCCGTT
GTGGAAATGG GCATGAGCGG TTTTGGCGAA ATAAGCCGTC TTACGTCTAT AGTAAAACCT
GATATTGCAA TAATAACCAA TATAGGAGTA TCCCATATAG AAAAGCTGGG TTCAAAAAAT
AACATATTAA AAGCAAAAAT GGAGATTTTT GAGGGCTTAA ATGAGAAAGG ATTGGCAATA
CTAAATGGTG ATGACAAATT ATTGTATGGA TTAAACAACC TTTTGAAGTT CAGGACGGTA
TTTTACGGAA TGGAGGAAGG GCTGGATTTG CGGGCATACA ATGTGGAATC TTTGGGGGAA
AAAGGCTCCA CTTTTGATAT TGAGATTAGA GGAAAAGAAT ACAGGGTGAG AATTCCTGTG
CCGGGAATTC ATAATGTTTA CAATGCCCTT GCAGGTATAG CAGTGGGAAT TGAGCTTGGA
ATACCGCCGG AGAAAATAGT TGAGGGAATT GAAGAGTTTT CCCCGGGAAA GATGAGACTT
GATATTATAA ACTATAACGG TTTAAAGATT ATAAACGATG CTTACAATGC AAGTCCCCAG
TCAATGGAGG CGGCTATTGA CGTTTTAAAG GATATATCCG GTGAAGGCAG AACCTTTGCC
GTATTGGGTG ACATGCTTGA GCTGGGGGAA TTCTCTAAAA GTGCTCATAT GGAAGTGGGA
AAATATGCTG CTTCAAAAGG GATAGATTAT ATTGTGGCCG TGGGAGAGTA CAGAAGTAAT
ATTGTCCGTG GTGCCGTTGA AGCAGGAGCA AAAGAAGAAA AGGTTTTTGA ATTTAAAGAC
AATATGGATG CCGCAAAGTT TTTAAAAGAA TTTGTAAAGA GCGGTGATGT GCTTCTTGTA
AAGGGTTCGA GAGGTATGAA AATGGAGGAA ATAGTTAATA TATTGACTGG CTGA
 
Protein sequence
MEILKCEEVV KAVGGTLISG EVNTVFYNIS TDSRNIKQGD LFIPLIGERF DGHNYIASAL 
EHGALGSLTQ KETEPFPGKV LIKVSDTLKA LRDLAVYYRQ KFKIPFVGIT GSVGKTSTKE
MVAAVLSKGF KVLKNQGNFN NEIGVPLTIF NLDKSHEAAV VEMGMSGFGE ISRLTSIVKP
DIAIITNIGV SHIEKLGSKN NILKAKMEIF EGLNEKGLAI LNGDDKLLYG LNNLLKFRTV
FYGMEEGLDL RAYNVESLGE KGSTFDIEIR GKEYRVRIPV PGIHNVYNAL AGIAVGIELG
IPPEKIVEGI EEFSPGKMRL DIINYNGLKI INDAYNASPQ SMEAAIDVLK DISGEGRTFA
VLGDMLELGE FSKSAHMEVG KYAASKGIDY IVAVGEYRSN IVRGAVEAGA KEEKVFEFKD
NMDAAKFLKE FVKSGDVLLV KGSRGMKMEE IVNILTG