Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2289 |
Symbol | |
ID | 8137629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2669190 |
End bp | 2670251 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869904 |
Product | Threonine aldolase |
Protein accession | YP_003022096 |
Protein GI | 253700907 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2008] Threonine aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.000000548444 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAGA AAAAGCAAAG CGGACACAAC GAGCTGAAGC ACCATTTCGC CAGTGACAAC TATGCGGGGA TCTGCAACGA AGCCTGGGCA GCGATGGCGG AGGCCAACCG CGGCATGGCC AGCTCCTATG GCGACGATTA CTGGACCGCG GAGGCCTGCG AAAAGATCCG CGAACTGTTC GAGACCGACT GCGAGGTTTT CTTCGTCTTC AACGGCACCG CAGCCAACTC GCTCGCCCTC GCCTCCTTGT GCCAGTCCTA CCACTCCATA GTCTGCCACG AGATGGCGCA CATAGAGACG GACGAGTGCG GCGCCTCCGA GTTCTTCTCA AACGGCACCA AGGTGCTGCT GGTACCGGGA GAAAACGGGA AAATCGACCT CGATGCGGTG GAGCACACCA TCCACAAGCG CAGCGACATC CACTACCCGA AGCCGAAAGC GCTCAGCATC ACCCAGGCCA CCGAACTGGG GACGCTTTAC AGCTTGCAGG AGTTGCAGGC GATTGGGGAA CTGGCCAAGA AACACTCGCT GCGGGTGCAT ATGGACGGCG CGCGCTTCGC GAACGCGGTA GCATCCCTGA ACGTCGCCCC GAAAGAGATC AGCTGGCAGG CCGGGGTCGA CGTCCTCACC TTCGGGGGAA CCAAGAACGG TTTTGCCGTG GGCGAGGCGG TGGTCTTCTT CAACAAGGAA CTCGCCTTCG AGTTCGACTA CCGCTGCAAG CAGGCGGGAC AGCTCGCCTC CAAGATGCGC TTTCTCACCG CCCCCTGGAT CGGCATGCTG GAGAGCGGCG CCTGGCTTAA AAACGCCGCC CACGCCAACA ACTGCGCGAG GCTTTTGGAA AGCGAGATCA AGAAGATACC GCAGGTGCGG ATCATGTTCC CGAGCCAGGC CAACTCCGTG TTTCTGGAAA TGGCGCCCGA TGCGCTGGAG GCGCTGCGAG CTCGCGGCTG GCACTTCTAC ACCTTCATAG GATCCGGCGG AGCCAGGTTC ATGTGCTCCT GGGACACCGA TACAGCCGAA GTAGCCAACC TGGTAGCCGA CATCAAGGCA TCGGTGGCAT AA
|
Protein sequence | MSKKKQSGHN ELKHHFASDN YAGICNEAWA AMAEANRGMA SSYGDDYWTA EACEKIRELF ETDCEVFFVF NGTAANSLAL ASLCQSYHSI VCHEMAHIET DECGASEFFS NGTKVLLVPG ENGKIDLDAV EHTIHKRSDI HYPKPKALSI TQATELGTLY SLQELQAIGE LAKKHSLRVH MDGARFANAV ASLNVAPKEI SWQAGVDVLT FGGTKNGFAV GEAVVFFNKE LAFEFDYRCK QAGQLASKMR FLTAPWIGML ESGAWLKNAA HANNCARLLE SEIKKIPQVR IMFPSQANSV FLEMAPDALE ALRARGWHFY TFIGSGGARF MCSWDTDTAE VANLVADIKA SVA
|
| |