Gene Athe_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2039 
Symbolddl 
ID7408252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2153766 
End bp2154860 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content37% 
IMG OID643716406 
ProductD-alanyl-alanine synthetase A 
Protein accessionYP_002573889 
Protein GI222530007 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000143922 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGT TAAAAGTTGC TGTTTTATTT GGAGGAGTTT CAACAGAACA CGAAATATCC 
ATAGTTTCGG CAAAATCAAT TATGCAAAAT ATGGACAAAG AAAAATACGA AGTAATTCCA
ATAGGTATAA CAAAAGAAGG AAAATGGCTC TTGTACACAG GCAAAATTGA AGATTTGGAT
AGCAAGTGGA CTCAGTTTTC TATAGAATGT TTTGTCTCTC CTGATAGAAC CAAAAAGGCA
CTTGTAAAAG TAAAAGATAA CGAGGCTACT TTCATCGACA TTGATGTGGT ATTCCCGGTT
TTGCATGGAC TGAATGGTGA GGACGGGACA GTTCAGGGAC TTTTAGAGCT CTCTGGCATA
CCATATGTAG GATGCGGTGT TCTTTCGTCG GCTATATGCA TGGACAAAGC GTTTGCTAAA
AAGCTTGCGC TTTTAGAAGG AATACCACAG GGGCATTTTT TGGTTGTATA CAAAAACGAA
TATTCAGCTA AAAAAGATTA TTTCATAAGA AGAATAGAGA GTGAGTTTTC GTATCCTGTT
TTTGTAAAGC CTGCAAACTC AGGCTCGTCT GTGGGTATAT CAAAAGCGAA AGATAGAGAA
GACCTTGTTT TGGCAATACA TGAGGCTTTT TTGTATGATA CAAAGATTTT GATTGAACAG
GCTATAAACG CTCGGGAGAT AGAATGTGCA GTTTTGGGAA ATGATGAGGT ATTTGTGTCT
GAGCCGGGAG AAATAATTCC GTCAAGAGAG TTTTATTCGT ATGAGGCAAA ATATATTGAT
AATTCATCAG AGCTCATCAT CCCGGCAAGA CTTCCAAAGG AAGTTACTGA AGAAATAAAA
GATTTGGCAG GAAGGATTTA CAAGATTTTT GAGTGCTGTG GAATGGCAAG AGTGGACTTT
TTTGTTGATA AAGATACGAA CAAAGTGTAT TTCAACGAGG TAAACACAAT ACCTGGTTTT
ACAAGTATTT CGATGTATCC AAAACTTATG GAGTTCAGTG GCATTCCATA TTCTCAACTC
ATTGATAAAC TAATTTCTCT TGCCATTGAG AAAAATAGAC AGAAAAAAAG CATAAAATAC
AGCAAAGAGG GCTGA
 
Protein sequence
MTKLKVAVLF GGVSTEHEIS IVSAKSIMQN MDKEKYEVIP IGITKEGKWL LYTGKIEDLD 
SKWTQFSIEC FVSPDRTKKA LVKVKDNEAT FIDIDVVFPV LHGLNGEDGT VQGLLELSGI
PYVGCGVLSS AICMDKAFAK KLALLEGIPQ GHFLVVYKNE YSAKKDYFIR RIESEFSYPV
FVKPANSGSS VGISKAKDRE DLVLAIHEAF LYDTKILIEQ AINAREIECA VLGNDEVFVS
EPGEIIPSRE FYSYEAKYID NSSELIIPAR LPKEVTEEIK DLAGRIYKIF ECCGMARVDF
FVDKDTNKVY FNEVNTIPGF TSISMYPKLM EFSGIPYSQL IDKLISLAIE KNRQKKSIKY
SKEG