Gene Athe_1405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1405 
Symbol 
ID7409148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1487543 
End bp1489495 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content40% 
IMG OID643715768 
Productbifunctional phosphoglycerate kinase/triosephosphate isomerase 
Protein accessionYP_002573276 
Protein GI222529394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID[TIGR00419] triosephosphate isomerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00167236 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAGC TCAACAAAAA GACCATAAGA GATATAGATG TTAGTGGCAA AAGAGTTCTT 
GTGAGGGTTG ATTTTAACGT TCCACAAGAT GAAAATGGTA ATATCACTGA CGATAGAAGA
ATAAGAGAAG CTCTTCCTAC AATAAAGTAT CTCATTGACC ACAACGCAAA GGTAATATTG
GTATCCCATT TGGGAAGACC GAAGGGCAAA TTTGACCCGA AATACTCGAT GGCTCCTGTT
GCAAAAAGAC TTTCTGAGCT TCTCGGCAAG GAAGTTGTTC TTGCAAAAGA CGTTATAGGC
GATGATGCAA AAAAGTGTGT TGAGCAGATG AAAGAAGGAG ATGTAGTTCT TCTTGAAAAT
GTCAGATTCC ACAAAGAGGA AGAAGAAAAT GATAGAGAAT TTGCAAAGGC TTTAGCCTCG
CTTGCAGACA TTTATGTCAA TGACGCATTT GGTACAGCTC ACAGAGCACA TGCATCAACA
GCAGGTGTTG CAGAGTTCTT GCCTGCAGTT GCTGGATTTT TGATGGAAAA AGAGATAGAA
ATGCTTGGCA ATGCTCTTGC AAATCCGCAA AGACCTTTTG TTGCAATCTT GGGTGGCGCA
AAAGTTTCTG ATAAGATTGG GGTTATTACA AATCTTCTTG AAAAGGTTGA TAGTCTCTTA
ATTGGCGGTG CAATGGCTTA TACCTTCTTG AAGGCAAAAG GATATAAAAT CGGGAAGTCA
AAATGCGAAG ATGATAAGCT TGATGTTGCA AGAGAGATAA TGAAAAAGGC AGAGGAAAAA
GGAGTAAACC TTCTGCTGCC TGTTGGAAGC ATAGTAGCAA AAGAGTTTAA AAATGATACA
GAGTTTATGT ACGTACCATC AGATGCAATG CCAGACGATA TGATGGGTAT GGACATAGGG
AATACCACAA TTGAGCTTTT CTCAAAAGAG ATAAAGAAGG CAAAGACCAT TGTTTGGAAC
GGACCAATGG GTGTATTTGA ATTTCCAAAC TTTGCAAAGG GAACAGAAGC TATCGCAAGA
GCTGTTGCTG AGGCTGTTGA AGAAAATGGC GCAATTGCAA TTATCGGTGG TGGCGACTCT
GCGGCTGCTG TTGAAAAACT GGGGTTTGCT GATAAGATGA CACATATTTC AACAGGTGGC
GGTGCTTCAT TAGAGTTCTT GGAAGGCAAA GTTTTACCAG GTATTGCATG TCTTCTTGAT
AAAAATCCAA GAAAAAAGAT AATCGCAGCA AACTGGAAGA TGAACAAGAC TCCTATTGAG
GCGAAAGAGT TTGTTGAAGA GCTGAAAAAA TATATTGATG ATGTTCAGGC AGAAGTAGTT
ATCTGTGCTC CATCAATTCT TGTTCCTTAT GTTAAAGAAG CAATAGAAGG AACAAATATA
AAACTTGGAA CACAAAACAT GTTCTATGAA GAAAAAGGTG CATATACAGG TGAGATCTCA
GGTCCAATGT TAAAGGAAGT TGGAGTTGAG TATGTGGTAA TTGGTCACTC TGAAAGAAGG
CAGTACTTTG GTGAAACTGA TGAGATTGTG AACAAGAAAG TGTTAGCAGC GCTCAAGTTC
GGTATCAAGC CTATTGTATG TGTTGGTGAG ACACTTAAGC AAAGAGAATA TGGTATTACA
GATGAGCTTG TAAGGCTTCA GGTCAAGATT GCACTAAATG GTGTCTCAAA AGAAGATGTT
GAAAAGGTTG TCATTGCATA TGAGCCTATC TGGGCAATAG GTACAGGTAA GAATGCAACA
CCTGAAGAGG CAAATAGAGT AATTGGGGTT ATCAGAAATG TAATTGCAGA GATTTACGAT
GAAGATACTG CGCAAAAGGT TAGAATTCAG TATGGCGGTA GTGTAAACTC TGCAAATTCA
GCAGACATTT TCAATATGCC AGAGATTGAT GGAGGCTTAG TTGGCGGTGC AAGCCTTAAT
GCTCAGGAAT TTGCAAAGAT ATTACACTAC TAA
 
Protein sequence
MPKLNKKTIR DIDVSGKRVL VRVDFNVPQD ENGNITDDRR IREALPTIKY LIDHNAKVIL 
VSHLGRPKGK FDPKYSMAPV AKRLSELLGK EVVLAKDVIG DDAKKCVEQM KEGDVVLLEN
VRFHKEEEEN DREFAKALAS LADIYVNDAF GTAHRAHAST AGVAEFLPAV AGFLMEKEIE
MLGNALANPQ RPFVAILGGA KVSDKIGVIT NLLEKVDSLL IGGAMAYTFL KAKGYKIGKS
KCEDDKLDVA REIMKKAEEK GVNLLLPVGS IVAKEFKNDT EFMYVPSDAM PDDMMGMDIG
NTTIELFSKE IKKAKTIVWN GPMGVFEFPN FAKGTEAIAR AVAEAVEENG AIAIIGGGDS
AAAVEKLGFA DKMTHISTGG GASLEFLEGK VLPGIACLLD KNPRKKIIAA NWKMNKTPIE
AKEFVEELKK YIDDVQAEVV ICAPSILVPY VKEAIEGTNI KLGTQNMFYE EKGAYTGEIS
GPMLKEVGVE YVVIGHSERR QYFGETDEIV NKKVLAALKF GIKPIVCVGE TLKQREYGIT
DELVRLQVKI ALNGVSKEDV EKVVIAYEPI WAIGTGKNAT PEEANRVIGV IRNVIAEIYD
EDTAQKVRIQ YGGSVNSANS ADIFNMPEID GGLVGGASLN AQEFAKILHY