Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0035 |
Symbol | |
ID | 8135334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 46821 |
End bp | 48770 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644867652 |
Product | hypothetical protein |
Protein accession | YP_003019880 |
Protein GI | 253698691 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.638764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGC GACTGCATCT CGAAAATTTC GGCAACGTCC ACGCCCTTCC CATCCTGCAC TACCGGATGG AGTTCGCGCA TCTGGTACGG GAAGCGTATG AGGTCCTGAA ACCGGACTGC ATCGCCATCG AGCTCCCCCG GACCCTGGAG CCGCAGTTCC TGCGCGCCGT CGCGCGCCTC CCCGAGCTTT CCGTCCTCGC CTACCACGTC GCCGGCCAAT CCGTCTTTCT CCTCGTCGAA CCCGCGGACC CTCTCATCGA GGGGGCGCGG CTCGCGCTCA AGCACCGCAT CCCGCTGCAC CTGGTCGACA TCGACCTGGA CAGCTACCCG TCCCACGACG AGCAGCTTCC CGACTCCTAC GCGGTGCAGC GCATCGGGCT GGAACCGTTC TACCGGGAGG TGGAAAAGCT CTACCGCGAG CTTGAGCCTT GCGACGAGGA TCTGCGCCGC GAGCGCGGCA TGGCCCACAG ACTGCAGCAG CTCTCCGCGC AGCACCAAAG GGTCCTTTTC ATCTGCGGCA TGTCGCACCT GGAGCGGATC CGGGAGAACT TCGGGAAACC TCTGGCGCAG CCTCTCACCC GCACCCACCG CGAAGGGGTC GCCATCTTCA ACCTGCACCC GGAATGCTGC CACGAGGTCC TGGCCGAGTA CCCGTTTCTT TCCTCCCTTT ACGAGACCAG GAGATCGCCG CTTCCTCCCG AACCCGCACA GGCGGCCTCC TTGCGGAAAA GCTTCAACGC CTTCGAGCTG ATCCTGGGAG GGAAGCAGTC GATCCCCGAA GAGCAGGCGC TCTTGGAGTC GATCCAGCGC AGCGCCCACC GCGTAGGGAG CGAAGGGGAG ATGCCGGACC GCCAGAAGGT CATGCTGCGG CTCTTCCTGG AGGCGGCGCG CCACTATCGC CAGGAGACCG GGGACAAGGT CCACTACTGG CAGAAGCGGG CCTTCTTCCG CTTCGTCAGG AACTACGCGC TCCTGTCGCA GATGCTCCTT CCCGACCTGT ACCAGATGCT CGCCGCCGCG CGCGGCTGCC TCGACGACAA CTTCGCCTAC GCCTTTTTCC GGCTTGCGGC GCACTACCCC TGGCAGAGCG AGCAAAGCGA CATCCCCACC CTGAGGCTTT CCGCAGCCGA GCTCTCCGCG GGGACCCGCA GGATACGCTT TCGCCCCCGG GAGCAGGTAC GCGGCAAGGG GCGCTCCGGC ATCAAGATGA CGAACCGGCG CAAGGAGAAG CGCCCCGGCG ACTGGTTGGA GGGGTTCGAC GACCCGTATA TCTGCTCCTA TCCCCCCGAG GACTTAAGCA TTGAGGAGTA CGGCCGCAAC CTGAAACGGA TCGGTGCGCG GCAGTTGAGC GAGGAGGCGA GCAGGACCGA GCCTTTCTCG GCGTCGCTTC TGGACGGGAT CGACATGCGC GAGACCATCA GGAACCTGCA CGAAGGGAAG ATCTACGTAA AGGAAAACAA GCGGTTAAAG AGCGGGGTAG GGTGTGTGGT GGTGGTCTTC GACGAGGACC GGGAGGACTC GGGGTATCCG TACTGCATGA CCTGGCTCGG CGAGCACGAC CAGGAGTCGG ACATGGCATT TTACGCCACG CCTCCCACCG ACAACATCGT CGGCCCCGGC ATTTCCCGCT GCGAGTACGG CGGATTCCTG TTAAGCTACC CGCCGCGCCG GATGCACGAC GTCTGGCAGG ACCCGGATTA CCGCGGGGCC CTGGGGAAGG GGGAGGTGCT TTTGATGGCG GCGCTGGATT ATTCGCTGGA GAAGGACGTG GTATATGCGG CGGCGAAGCC GCCGAGGAGC TATCTGAAGC AGCAGGCGGC TCGGCTGGGA AAGAGGATCA TCTATCTGCC GCTGGGGAGC CTCTCGCCGG TGGCGCTCAA GAGGCTCCGG GCGTTTCACA TTCTCTACGG CAAGGACAAG CGGGACATCG CCAAGGAGTA TATCTGGTAA
|
Protein sequence | MPERLHLENF GNVHALPILH YRMEFAHLVR EAYEVLKPDC IAIELPRTLE PQFLRAVARL PELSVLAYHV AGQSVFLLVE PADPLIEGAR LALKHRIPLH LVDIDLDSYP SHDEQLPDSY AVQRIGLEPF YREVEKLYRE LEPCDEDLRR ERGMAHRLQQ LSAQHQRVLF ICGMSHLERI RENFGKPLAQ PLTRTHREGV AIFNLHPECC HEVLAEYPFL SSLYETRRSP LPPEPAQAAS LRKSFNAFEL ILGGKQSIPE EQALLESIQR SAHRVGSEGE MPDRQKVMLR LFLEAARHYR QETGDKVHYW QKRAFFRFVR NYALLSQMLL PDLYQMLAAA RGCLDDNFAY AFFRLAAHYP WQSEQSDIPT LRLSAAELSA GTRRIRFRPR EQVRGKGRSG IKMTNRRKEK RPGDWLEGFD DPYICSYPPE DLSIEEYGRN LKRIGARQLS EEASRTEPFS ASLLDGIDMR ETIRNLHEGK IYVKENKRLK SGVGCVVVVF DEDREDSGYP YCMTWLGEHD QESDMAFYAT PPTDNIVGPG ISRCEYGGFL LSYPPRRMHD VWQDPDYRGA LGKGEVLLMA ALDYSLEKDV VYAAAKPPRS YLKQQAARLG KRIIYLPLGS LSPVALKRLR AFHILYGKDK RDIAKEYIW
|
| |