Gene Cthe_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3101 
Symbol 
ID4809727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3657482 
End bp3658714 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content43% 
IMG OID640108529 
ProductL,L-diaminopimelate aminotransferase 
Protein accessionYP_001039489 
Protein GI125975579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR03542] LL-diaminopimelate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTA TTAATGAAAA TTATCTTAAG CTTCCGGGAA GCTACCTTTT TTCTGAAATT 
GCGAGGAGAG TGGACAATTT CAGAAAGGAA AATCCCAATG CAAAAATAAT ACGGCTGGGT
ATTGGAGATG TTACAAAGCC GTTGGCGCCG GCAGTTATTG ACGCTTTGCA CAAAGCGGTG
GACGAAATGG CAAAAGAGGA GACTTTTAAA GGATACGGAC CGGAGCAAGG TTATAGCTTC
TTAGTCAGCA AAATAATTGA ATATGACTAT ATGCCCCGGG GAATCAGGCT TGATGAGGAC
GAGGTTTTTG TAAGCGACGG GGCGAAAAGT GATACTGGAA ATTTCCAGGA GATATTTGGC
CTGGACAACA AAGTTGCCGT TACCGACCCT GTATATCCTG TTTATGTTGA CAGCAATGTT
ATGGCAGGAA GGACCGGAAA GTATCTTGCG AATGGTTATT TTGAGAATAT AACCTATCTT
CCGTGTACTG CCGAAAACAA TTTCATTCCT GAACTTCCAA AAGAGAAAGT GGATATTATT
TACCTTTGTT TCCCAAATAA TCCGACGGGA ATGACCTTGT CTAGGGAAGA ACTTAAAAAG
TGGGTCGACT ATGCAAGGGA AAACCGCGCG ATAATACTGT TTGACTCGGC ATACGAGGCG
TATATCCGTG AGAAAGATGT GCCCCACAGC ATTTATGAGG TTGAGGGAGC AGATGAGGTG
GCAATTGAGT TTAGAAGCTT TTCCAAGACG GCAGGTTTTA CCGGAACAAG GTGTGCGTAT
ACCGTAGTTC CCAAAAAGGT TGTGGCTTAT ACCAAAAACG GAGAAGCGCA TCAGCTCAAC
AGCCTTTGGA ACAGAAGACA GACAACAAAA TTCAACGGTG TTCCGTATAT TATACAGCGG
GCAGCGGCGG CGGTTTATAC CCCGGAGGGA CAAAAACAGA CTAAAGAAAC CATAGACTAT
TACATGGAAA ATGCAAAAAT AATCAAACAA GGTTTGGAGG ATATCGGGCT TACCGTATTT
GGAGGAGTAA ATGCTCCGTA TATCTGGCTT AAGACTCCGG ATGGCATAAG TTCATGGGAA
TTTTTTGATA TCATGCTAAA AGAAATAAAT GTTGTCGGAA CACCCGGTTC AGGATTCGGA
CCGAGCGGAG AAGGATATTT CCGGTTAACC GCTTTCGGAA GCAGGGAGAA TACTCTTGAG
GCTGTGGAAA GATTTAAAAA TTTGAAATTT TAG
 
Protein sequence
MAFINENYLK LPGSYLFSEI ARRVDNFRKE NPNAKIIRLG IGDVTKPLAP AVIDALHKAV 
DEMAKEETFK GYGPEQGYSF LVSKIIEYDY MPRGIRLDED EVFVSDGAKS DTGNFQEIFG
LDNKVAVTDP VYPVYVDSNV MAGRTGKYLA NGYFENITYL PCTAENNFIP ELPKEKVDII
YLCFPNNPTG MTLSREELKK WVDYARENRA IILFDSAYEA YIREKDVPHS IYEVEGADEV
AIEFRSFSKT AGFTGTRCAY TVVPKKVVAY TKNGEAHQLN SLWNRRQTTK FNGVPYIIQR
AAAAVYTPEG QKQTKETIDY YMENAKIIKQ GLEDIGLTVF GGVNAPYIWL KTPDGISSWE
FFDIMLKEIN VVGTPGSGFG PSGEGYFRLT AFGSRENTLE AVERFKNLKF