Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1724 |
Symbol | |
ID | 8137055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2009804 |
End bp | 2011618 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869336 |
Product | hypothetical protein |
Protein accession | YP_003021536 |
Protein GI | 253700347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGC TTTTAGCGGC ACTCACCTGT ATCGCCATGG TCCTGGCACT CGCTGTGCCG GTTGCCCTGG CAGCCAGACC GGTCGCAGAC AAAACCGCAC CGGTCACAAC CGCTTCGCCG CTTGGTGGCA GCTTCACCGC ACCTGTCACG GTGACCCTGA GCGTCAACGA GGCCGCAACT ACCTACTACA CCACCAACGG CAGCACCCCG ACCACCGCCT CGACCGTCTA CAGTGCGCCG CTTGTCATAA GCGCTACCAC TACCCTGAAG TACTTCTCCA AGGACACGGC CGGGAACCTC GAGGCCGTAA AAAGCCAGAC CTATACCGTG ACCGCCGGCG GTCACGCGAG CCTCACCTGG ACCGGCTACA ACATGTGCAG CTCCTGCCAC GACGCCCAGG CGAAGGCCAT GTACCAGAGC GTCCACTACC AGTGGAAAGG GTCCGCCGCA GAGATGACCA CCGGGCCGGC TGCCCAGGGC AAGATGGACG CGGTGGACGG CTCCAGCGCG CTCAACGCCT ACTGCATCAA CATCCAGGGC AACTGGGGTC CTTGCGGCGC CTGCCATGCC GGTACCGGCG CGAAGCCCGT GGCTACCGCC AACCCTTCCG CCTCTCAGCT CGCGGCCATC GACTGCCTCA TGTGCCACAA CGACACCGTC AACGCTCCCT ACAGCCGCGT GCGTAACGCC ACCACGGGCC TGTTCGAGCC AGCTGCCGGT CTCAACATGA ACCTGGTGGT ACAGAAGGCG AGCATCAAGC CGACCCGCAA AAACTGCCTC GGCTGCCATG CCAAAGCCGG CGGCGGCGAC GCCGTTAAGC GCGGCGATAT CGCCCTTGCT TCCGGCACCA CCGCAGACGT CCTTTACGAC ACCCACATGG CGACCGGCAA CGGCGGCAAC CTCGCCTGCC AGGCCTGCCA TACCTTCTCC AGCCACCGCG TCGCCGGTCG TGGCTCCGAC CTGCGTCCCG AGGACAGCAC CCTCGAAGTC AACTGCTCCA CCAGCACCTG CCACGCGACC AAAACCAACA TGAGCACCGG CCACACCACC TATGATACGA GCCACCACGT AGGTCGCGTC GCCTGCCAGA GCTGCCACAT TCCGAAATAC GCCAGAAACG CCAACGACAC CGCGGCCACC GAGGCGACCG AGACCTATCG CAACTGGCAG GTGGCCGAGT GGAACGCAAC CCTCAACCGT TACGAGCCGA TGCCGACCAA GGCCAACGAC CTGAAGCCTG CGTACGCCTT CTGGAACGGC GTGAGCTGGG GCAACAACTC CTTCGACGCC GCAGTTCTCG ATCCGGCGAC CGGCGCCTAC CAGATCTCCC GCCCAGTCGG CACCCTAAAC GGCCCTGCCG GAACCAAGCT CTACCCGTTC AAGTACAAGA CCGCCAGCCA GGCACTCGCT AACGGCAAGA TCGTTCCGCT CGCCACCTCC ACCTTCTTTG CCACCGGCAA CTACGATCAG GCGGTCAAGG ATGGGATGGT GTACATCGGG CTCCCCAGCA CCACCGCGTA CACCAACGTA ACCACCGACG AGTACCAGGT GCTCAATCAC CAGGTTCCGC CCGCTGCAGG GAACGCTCTT GCCTGCGCCG CGTGCCACCC GAACGCCACC GCGACCCAGA TGAAGCTGGT CACCAACTTC GGCTACGGCT TGAAGGCTGC CACCTCCGTC GTCTGCTCGC AGTGCCACAA CGCGAAGACT CCGGGCTCCT ACGACCGCAT CCACAGCCAC GTCGAGGGCA AGGGCTTCGA CTGCTCCTGG TGCCACAACT TCTCACGTCC TGAGCGCGGC CTGACCATGC CATAG
|
Protein sequence | MRKLLAALTC IAMVLALAVP VALAARPVAD KTAPVTTASP LGGSFTAPVT VTLSVNEAAT TYYTTNGSTP TTASTVYSAP LVISATTTLK YFSKDTAGNL EAVKSQTYTV TAGGHASLTW TGYNMCSSCH DAQAKAMYQS VHYQWKGSAA EMTTGPAAQG KMDAVDGSSA LNAYCINIQG NWGPCGACHA GTGAKPVATA NPSASQLAAI DCLMCHNDTV NAPYSRVRNA TTGLFEPAAG LNMNLVVQKA SIKPTRKNCL GCHAKAGGGD AVKRGDIALA SGTTADVLYD THMATGNGGN LACQACHTFS SHRVAGRGSD LRPEDSTLEV NCSTSTCHAT KTNMSTGHTT YDTSHHVGRV ACQSCHIPKY ARNANDTAAT EATETYRNWQ VAEWNATLNR YEPMPTKAND LKPAYAFWNG VSWGNNSFDA AVLDPATGAY QISRPVGTLN GPAGTKLYPF KYKTASQALA NGKIVPLATS TFFATGNYDQ AVKDGMVYIG LPSTTAYTNV TTDEYQVLNH QVPPAAGNAL ACAACHPNAT ATQMKLVTNF GYGLKAATSV VCSQCHNAKT PGSYDRIHSH VEGKGFDCSW CHNFSRPERG LTMP
|
| |