Gene Athe_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1406 
Symbol 
ID7409149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1489606 
End bp1490619 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content40% 
IMG OID643715769 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_002573277 
Protein GI222529395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000337546 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTTA AGATTGGTAT TAATGGTTTT GGAAGAATTG GTAGAAATGC TTTCAAAGCA 
ATTTTGGCAA ATTATCCAAA TGAGTTTGAG GTTGTTGCGG TAAACGACCT GACAGACCCA
AAGACATTAG CACATCTTTT AAAGTATGAC TCCTGTTATG GTATCTTCAA TGGCACAGTT
GACTATACAG ACACATCAAT AATTGTCAAT GGCAAAGAGA TAAAGGTATT AGCTGAAAAA
GACCCAGCAA ATCTTCCATG GAAAGATTTG GGAGTTGAGG TTGTAATTGA GTCAACAGGT
AGATTTACAA AGAAACAGGA TGCTGAAAAG CACATTCAAG CAGGTGCAAA GAAGGTAATC
ATCACAGCTC CGGCAACAGA TGAAGACATC ACAATTGTTA TGGGTGTAAA TGAGGAGATG
TACGACCCTG CTAAGCACCA TGTAATTTCA AATGCGTCCT GTACAACAAA CTGTTTAGCA
CCAGTTACAA AGGTTATTGA CAAGCATTTC AAGGTAAAAA GAGGTCTTAT GACAACAGTT
CACTCATATA CAAATGACCA ACAGATTTTG GATCTCCCAC ACAAGGATTT AAGGAGAGCA
AGAGCAGCAG CGCTTTCTAT TATTCCAACA ACAACCGGTG CGGCAAAGGC AGTAGCGCTT
GTTCTTCCAC ATCTCAAAGG AAAACTCAAT GGTTTTGCAC TCAGAGTTCC AACACCAACT
GTTTCTGTTA CAGACGTTGT GTTTGAGGTT GAAAAGCCAA CAACAAAAGA AGAAGTAAAC
AGCGTTTTGA AAGCTGCTGC AGAAGGCGAA TTAAAGGGTA TTTTGGGATA CAGCGAAGAA
CCGCTTGTTT CTGTTGACTA CAAAGGCGAT CCAAGGTCTT CAATAGTTGA TGCTCTCTCA
ACAATGGTTA TCGAAGATAC ACTTGTAAAG GTTGTTGCAT GGTACGACAA CGAGTGGGGT
TATTCCAACA GAGTTGCAGA CCTTTTGAAC TATATTGTTA GCAAGGGACT GTAA
 
Protein sequence
MAVKIGINGF GRIGRNAFKA ILANYPNEFE VVAVNDLTDP KTLAHLLKYD SCYGIFNGTV 
DYTDTSIIVN GKEIKVLAEK DPANLPWKDL GVEVVIESTG RFTKKQDAEK HIQAGAKKVI
ITAPATDEDI TIVMGVNEEM YDPAKHHVIS NASCTTNCLA PVTKVIDKHF KVKRGLMTTV
HSYTNDQQIL DLPHKDLRRA RAAALSIIPT TTGAAKAVAL VLPHLKGKLN GFALRVPTPT
VSVTDVVFEV EKPTTKEEVN SVLKAAAEGE LKGILGYSEE PLVSVDYKGD PRSSIVDALS
TMVIEDTLVK VVAWYDNEWG YSNRVADLLN YIVSKGL