Gene Cthe_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1041 
SymbolmurD 
ID4811338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1244775 
End bp1246175 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content42% 
IMG OID640106462 
ProductUDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase 
Protein accessionYP_001037466 
Protein GI125973556 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000538444 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATACCA AATTAAATGA CTTTAAAAAG AAAATTAAAA ATAAAAAAGT TGCCGTACTG 
GGTATTGGAA TCAGCCATAC TCCGTTAATT TCATATTTGT ACCGGCTTGG GGCGGACATA
ACGGCTTTCG ACAAAGCGGA TGAGGTTAAG CTTAAGTCCA CATTGGAACA GTTTAAAGGA
ATGGATATTA AATATAGCCT GGGCGAAGGT TATCTTGACA ATCTGAAAGG CTTTGACATA
TTGTTCAGGA CTCCTGGTAT GAGGTATGAT ATACCTGAGA TTCTTGCCGC AAAGGAGGAA
GGAACTGAGG TAACTTCAGA AATGGAAGTA TTTTTTGAAC TTTGCCCGGC CGAAATTTTT
GCCGTGACAG GCAGCGACGG TAAGACCACA ACCACCACGC TTATATACAA CATGCTGAAA
GAGCAGGGAT ATAAGTGCTG GCTTGGTGGA AATATAGGCA TACCTTTGCT CAGCAAAATT
GAAGAGATAA AGGATACGGA CAAAGTTGTC CTGGAGCTTA GCAGTTTTCA GCTGCACACC
ATGACTAAAA GCCCGAATGT TGCGGTTGTG ACAAATGTTT CGCCCAACCA CCTGGATGTA
CATAAATCCA TGGAGGAGTA TGTATCTGCC AAAAAGAATA TTTTTAGATA CCAGTCGGCG
GAGGATAGGC TGGTACTTAA TTTTGACAAT GACATTACCC GGGAATTTGC CGGTGAGGCA
AAAGGAGATG TAGTTTATTT CAGCAGAAAA ACCGTACTTG AAAAGGGTGC TATGCTAAAA
GACGATATGC TGGTTTTCAG AGACGGAGAA ACTGAAACTG AAATTGCTAA AGCAAGTGAC
ATCGTAATTC CGGGAGTCCA TAACGTTGAA AACTTTCTTG CGGCCACCGC CGCGGTAATA
GACTGTGTCG ACAGGGATGT CATAAGAAAA GTTGCAACCA CCTTTACCGG GGTTGAACAC
AGGATTGAAC TTGTAAGGGA GATCAACGGT GTAAAATTTT ACAATGATTC CATAGCAAGT
AGCCCCACCA GAACGATTGC AGGATTGAAT TCATTTAAAG ATAAAGTAAT TCTGATTGCC
GGAGGCTATG ATAAAAAAAT ACCTTATGAT GCTTTAGGAC CGGTAATTGC GGAAAAGGTA
AAGTGCCTTG TGTTAATTGG ACAGACGGCT CCGAAAATTG AAAAGGTATT AAGGGATGAG
ACGGAAAGGT CGGGAAAAGG CTCCGATATT CCGATAAAAA AATGTACCTC TCTTGAGGAA
GCCGTTAAGG TGGCTTACCG TTTTGCTTCC GTCGGGGACG TAGTAATTTT GTCTCCTGCA
AGCGCCAGCT TTGATATGTT TAAGAATTTT GAAGAGAGGG GCAATAGATT TAAAGAAATT
GTCAACTCAA TTGAAGCCTG A
 
Protein sequence
MNTKLNDFKK KIKNKKVAVL GIGISHTPLI SYLYRLGADI TAFDKADEVK LKSTLEQFKG 
MDIKYSLGEG YLDNLKGFDI LFRTPGMRYD IPEILAAKEE GTEVTSEMEV FFELCPAEIF
AVTGSDGKTT TTTLIYNMLK EQGYKCWLGG NIGIPLLSKI EEIKDTDKVV LELSSFQLHT
MTKSPNVAVV TNVSPNHLDV HKSMEEYVSA KKNIFRYQSA EDRLVLNFDN DITREFAGEA
KGDVVYFSRK TVLEKGAMLK DDMLVFRDGE TETEIAKASD IVIPGVHNVE NFLAATAAVI
DCVDRDVIRK VATTFTGVEH RIELVREING VKFYNDSIAS SPTRTIAGLN SFKDKVILIA
GGYDKKIPYD ALGPVIAEKV KCLVLIGQTA PKIEKVLRDE TERSGKGSDI PIKKCTSLEE
AVKVAYRFAS VGDVVILSPA SASFDMFKNF EERGNRFKEI VNSIEA