Gene Athe_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1414 
Symbol 
ID7409157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1498592 
End bp1499815 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content37% 
IMG OID643715777 
Productaminotransferase class I and II 
Protein accessionYP_002573285 
Protein GI222529403 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.445663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTAATT TAGAAAAATA TATTTCAAAG AGCGTTCAAA GCGTTCCACC CTCAGGTATT 
AGGAAATTTT TTGATATTGT ATCTGAGATG AAGGATGCTC TATCACTTGG AGTTGGTGAA
CCTGATTTTG TAACTCCATG GAACATTCGC GAGATGGGGA TATATTCTAT TGAAGAAGGA
CATACCCACT ATACATCAAA TTTCGGGCTT CTGGAGCTGA GAAAAGAGAT TAGCAGGTAT
TTGAAAGATA GATTCGACCT TAGTTATCCA AATTACAGAG ATCAGATTTT GGTAACTGTT
GGGGCAAGCG AAGCAATCGA TATTGCTTTA AGAAGTATAG TAAATCCTGG TGATGAGGTT
TTGATTCCTG AACCCTGTTT TGTTTCATAC AAACCATGTG TAATTTTTGC AGGTGGTGTA
CCGGTTGAGA TTGAAACAAG ACCGGAAAAC GACTTTAAGC TTAGAGCAGA GGATATTTTA
CCCAAAATAT CTTCCAAAAC TAAGGCTATT ATTTTATCTT ATCCCAACAA TCCAACAGGT
GCTATTATGA CAAGAGAGGA TTTAAAAGAG ATTGTTGATA TATTGAAAGA TAGGGATATC
ATAGTAATAT CTGACGAAAT ATATGCTGAG CTTACATATG AAGGAAGCCA TGTTTCAATT
GCTAATTTCC CTGAAATGAA AGAAAAGACT ATTGTTATAA ATGGTTTTTC AAAAGCCTTT
GCCATGACAG GTTGGAGACT TGGATTTGTT GCTGCCAATG AAGTTTTTAT TAAAGCGATG
GCAAAGGTAC ATCAGTACAT TATAATGAGT GCTCCTACAT TTTCTCAGTA TGCTGCTATT
GAAGCGCTTA AAAATGGGCT TTTAGAAGTT GAAAAGATGA AAGATGAATA CAATAGAAGA
AGACGCTATA TGGTAAGCAG ATTTAATAAG ATGGGGCTTG AATGTTTTGA GCCAAAAGGA
GCGTTTTATG TTTTTCCATC CATAAAATCC ACCGGTCTTC CTTCTGAGGA GTTTGCAGAA
AGGCTTTTGT ACGAACAAAA AGTAGCAGTT GTGCCAGGTA CTGCATTTGG GAGGTCAGGA
GAAGGATTTA TAAGATGTTC GTATGCCTAT TCGATTGAAA CCATAAAACA AGCTTTGGAC
AGGATAGAAA AGTTTGTTTT AAGTCTCAAA ACTCAAGGTG TCTTCCAGCA AACTCGTGAA
AATAATGTGG TAGTGGAGAA GTAA
 
Protein sequence
MSNLEKYISK SVQSVPPSGI RKFFDIVSEM KDALSLGVGE PDFVTPWNIR EMGIYSIEEG 
HTHYTSNFGL LELRKEISRY LKDRFDLSYP NYRDQILVTV GASEAIDIAL RSIVNPGDEV
LIPEPCFVSY KPCVIFAGGV PVEIETRPEN DFKLRAEDIL PKISSKTKAI ILSYPNNPTG
AIMTREDLKE IVDILKDRDI IVISDEIYAE LTYEGSHVSI ANFPEMKEKT IVINGFSKAF
AMTGWRLGFV AANEVFIKAM AKVHQYIIMS APTFSQYAAI EALKNGLLEV EKMKDEYNRR
RRYMVSRFNK MGLECFEPKG AFYVFPSIKS TGLPSEEFAE RLLYEQKVAV VPGTAFGRSG
EGFIRCSYAY SIETIKQALD RIEKFVLSLK TQGVFQQTRE NNVVVEK