Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1405 |
Symbol | |
ID | 7409148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1487543 |
End bp | 1489495 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643715768 |
Product | bifunctional phosphoglycerate kinase/triosephosphate isomerase |
Protein accession | YP_002573276 |
Protein GI | 222529394 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | [TIGR00419] triosephosphate isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00167236 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAGC TCAACAAAAA GACCATAAGA GATATAGATG TTAGTGGCAA AAGAGTTCTT GTGAGGGTTG ATTTTAACGT TCCACAAGAT GAAAATGGTA ATATCACTGA CGATAGAAGA ATAAGAGAAG CTCTTCCTAC AATAAAGTAT CTCATTGACC ACAACGCAAA GGTAATATTG GTATCCCATT TGGGAAGACC GAAGGGCAAA TTTGACCCGA AATACTCGAT GGCTCCTGTT GCAAAAAGAC TTTCTGAGCT TCTCGGCAAG GAAGTTGTTC TTGCAAAAGA CGTTATAGGC GATGATGCAA AAAAGTGTGT TGAGCAGATG AAAGAAGGAG ATGTAGTTCT TCTTGAAAAT GTCAGATTCC ACAAAGAGGA AGAAGAAAAT GATAGAGAAT TTGCAAAGGC TTTAGCCTCG CTTGCAGACA TTTATGTCAA TGACGCATTT GGTACAGCTC ACAGAGCACA TGCATCAACA GCAGGTGTTG CAGAGTTCTT GCCTGCAGTT GCTGGATTTT TGATGGAAAA AGAGATAGAA ATGCTTGGCA ATGCTCTTGC AAATCCGCAA AGACCTTTTG TTGCAATCTT GGGTGGCGCA AAAGTTTCTG ATAAGATTGG GGTTATTACA AATCTTCTTG AAAAGGTTGA TAGTCTCTTA ATTGGCGGTG CAATGGCTTA TACCTTCTTG AAGGCAAAAG GATATAAAAT CGGGAAGTCA AAATGCGAAG ATGATAAGCT TGATGTTGCA AGAGAGATAA TGAAAAAGGC AGAGGAAAAA GGAGTAAACC TTCTGCTGCC TGTTGGAAGC ATAGTAGCAA AAGAGTTTAA AAATGATACA GAGTTTATGT ACGTACCATC AGATGCAATG CCAGACGATA TGATGGGTAT GGACATAGGG AATACCACAA TTGAGCTTTT CTCAAAAGAG ATAAAGAAGG CAAAGACCAT TGTTTGGAAC GGACCAATGG GTGTATTTGA ATTTCCAAAC TTTGCAAAGG GAACAGAAGC TATCGCAAGA GCTGTTGCTG AGGCTGTTGA AGAAAATGGC GCAATTGCAA TTATCGGTGG TGGCGACTCT GCGGCTGCTG TTGAAAAACT GGGGTTTGCT GATAAGATGA CACATATTTC AACAGGTGGC GGTGCTTCAT TAGAGTTCTT GGAAGGCAAA GTTTTACCAG GTATTGCATG TCTTCTTGAT AAAAATCCAA GAAAAAAGAT AATCGCAGCA AACTGGAAGA TGAACAAGAC TCCTATTGAG GCGAAAGAGT TTGTTGAAGA GCTGAAAAAA TATATTGATG ATGTTCAGGC AGAAGTAGTT ATCTGTGCTC CATCAATTCT TGTTCCTTAT GTTAAAGAAG CAATAGAAGG AACAAATATA AAACTTGGAA CACAAAACAT GTTCTATGAA GAAAAAGGTG CATATACAGG TGAGATCTCA GGTCCAATGT TAAAGGAAGT TGGAGTTGAG TATGTGGTAA TTGGTCACTC TGAAAGAAGG CAGTACTTTG GTGAAACTGA TGAGATTGTG AACAAGAAAG TGTTAGCAGC GCTCAAGTTC GGTATCAAGC CTATTGTATG TGTTGGTGAG ACACTTAAGC AAAGAGAATA TGGTATTACA GATGAGCTTG TAAGGCTTCA GGTCAAGATT GCACTAAATG GTGTCTCAAA AGAAGATGTT GAAAAGGTTG TCATTGCATA TGAGCCTATC TGGGCAATAG GTACAGGTAA GAATGCAACA CCTGAAGAGG CAAATAGAGT AATTGGGGTT ATCAGAAATG TAATTGCAGA GATTTACGAT GAAGATACTG CGCAAAAGGT TAGAATTCAG TATGGCGGTA GTGTAAACTC TGCAAATTCA GCAGACATTT TCAATATGCC AGAGATTGAT GGAGGCTTAG TTGGCGGTGC AAGCCTTAAT GCTCAGGAAT TTGCAAAGAT ATTACACTAC TAA
|
Protein sequence | MPKLNKKTIR DIDVSGKRVL VRVDFNVPQD ENGNITDDRR IREALPTIKY LIDHNAKVIL VSHLGRPKGK FDPKYSMAPV AKRLSELLGK EVVLAKDVIG DDAKKCVEQM KEGDVVLLEN VRFHKEEEEN DREFAKALAS LADIYVNDAF GTAHRAHAST AGVAEFLPAV AGFLMEKEIE MLGNALANPQ RPFVAILGGA KVSDKIGVIT NLLEKVDSLL IGGAMAYTFL KAKGYKIGKS KCEDDKLDVA REIMKKAEEK GVNLLLPVGS IVAKEFKNDT EFMYVPSDAM PDDMMGMDIG NTTIELFSKE IKKAKTIVWN GPMGVFEFPN FAKGTEAIAR AVAEAVEENG AIAIIGGGDS AAAVEKLGFA DKMTHISTGG GASLEFLEGK VLPGIACLLD KNPRKKIIAA NWKMNKTPIE AKEFVEELKK YIDDVQAEVV ICAPSILVPY VKEAIEGTNI KLGTQNMFYE EKGAYTGEIS GPMLKEVGVE YVVIGHSERR QYFGETDEIV NKKVLAALKF GIKPIVCVGE TLKQREYGIT DELVRLQVKI ALNGVSKEDV EKVVIAYEPI WAIGTGKNAT PEEANRVIGV IRNVIAEIYD EDTAQKVRIQ YGGSVNSANS ADIFNMPEID GGLVGGASLN AQEFAKILHY
|
| |