Gene Athe_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1201 
Symbol 
ID7409675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1293833 
End bp1295113 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content37% 
IMG OID643715566 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_002573074 
Protein GI222529192 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0421801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTTG ACAAAAGTAA AGAAGTATTT GACAACACCA AAAGATATAT ACCAGGCGGG 
GTTAACAGTC CAGTTCGTGC ATTTAAAAAT TTGAGTATTA CACCGCCTGT CATATCAAAA
GGAAAAGGCT GCCGTATATT TGATATTGAT GGCAATGAAT ATATTGATTT TGTTCTGTCC
TGGGGTGCGA TGATATTAGG ACATTGTGAC CCTGATGTTG TAAATAGGAT GAAAGAAGTG
GTGGAAGATC AAATAGCATT TGGAGCACCA ACAGAAATTG AATATAAGAT GGCAAAGCTT
GTGTGTGAGA CAGCCCAAAT TGATATGGTT CGATTTGTTA ATTCAGGAAC AGAAGCTACA
ATGACTGCTG TAAGGCTTGC AAAAGGTTAT ACTGGGAAGA AAAAAATAGT AAAGTTTGCA
GGCTGTTATC ATGGTCATCA TGACATATTT CTGAAAGAAG CAGGGTCAGC AGTAGCCGAG
CTAAGATTAA AGCGAATTGA TGAAGATATT GTACAAAATA CAATTGTGGT TGAATACAAC
AATTTAGATT CAGTAGAAAA AGCTTTTAAA GAAAACAAAG ATGAGATAGC AGCTGTTATA
ATCGAGCCTG TGGCAGGGAA TATGGGTGTT GTACCTGCCA AAAAAGAGTT TTTGCAAGTC
CTAAGAGAAA TTTGCAACCT CCACGGCAGT CTTCTGATTT TTGATGAAGT AATAACCGGC
TTTAGGCTCT CATTAAAAGG GGCAAGAGCT TTATATAATG TTGAGCCAGA CCTTGTAACT
TTTGGCAAGA TAATTGGTGG AGGGCTTCCT TGTGGCGCAG TTGGTGGCAA GAAAGAGATT
ATGGAATGTT TAGCACCACA GGGAAATGTC TTTCAGGCAG GTACTATGTC GGGCAATCCA
ATTGTGATGA GTGCAGGGTA CGCTACTATC AAAAAGCTTA AAGAAAATCC TCATTTTTAT
AGTAATTTGG AGATGTTAGC AGGAAAACTC GAAAAAGAGT TGACACAAGT CTTTTCTAAT
TCCAATTTAA CTTTTTGCAT AAACAGGGTA GGTTCAATGC TAACAATCTT CTTTGGAGTT
GAAAAGGTAG AAAATTTCGA GATGGCAAAG ATGAGCGATT TAGACTTGTT CAGAAGTTTT
GCAGAATATA TGATAAAAAA CCATATTTAT GTTCCTTCCT CTCAATTTGA AGCGATGTTC
TTATCTGTAG CACATAGCGA AAATGATGTA GAAAAATTCG TTGAAATTGC TGAGGAATTT
TGCTCTTCAA AAAGGAAATG A
 
Protein sequence
MRLDKSKEVF DNTKRYIPGG VNSPVRAFKN LSITPPVISK GKGCRIFDID GNEYIDFVLS 
WGAMILGHCD PDVVNRMKEV VEDQIAFGAP TEIEYKMAKL VCETAQIDMV RFVNSGTEAT
MTAVRLAKGY TGKKKIVKFA GCYHGHHDIF LKEAGSAVAE LRLKRIDEDI VQNTIVVEYN
NLDSVEKAFK ENKDEIAAVI IEPVAGNMGV VPAKKEFLQV LREICNLHGS LLIFDEVITG
FRLSLKGARA LYNVEPDLVT FGKIIGGGLP CGAVGGKKEI MECLAPQGNV FQAGTMSGNP
IVMSAGYATI KKLKENPHFY SNLEMLAGKL EKELTQVFSN SNLTFCINRV GSMLTIFFGV
EKVENFEMAK MSDLDLFRSF AEYMIKNHIY VPSSQFEAMF LSVAHSENDV EKFVEIAEEF
CSSKRK