Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2911 |
Symbol | |
ID | 8138254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3381362 |
End bp | 3382873 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644870509 |
Product | hypothetical protein |
Protein accession | YP_003022698 |
Protein GI | 253701509 |
COG category | [S] Function unknown |
COG ID | [COG3786] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.000000449033 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGACCAC TGCGCACCCT TTTGGGCGGC TTCGCACTCC TTGCAGTTTA CCTGTCCGCA GGATGCGGCA GCTCCGGCGA GCTGCCGCGC CGGATCCGGC AGGATCTGGA CACGGCCCTC CTCGGCAAGA GCCGGCAACT CCTCTACGTC GATGCCGTCA GCCCCGCCTC GACGGGCGCG ACCCTGTACC CGCTGCAGCG CGGCCTCCTT GGGTGGCAAC TGGCCTCCCT GCCGGTGCCG GTGAATCTGG GTAGAAACGG GGTGGCGCCT CCCTTCGAGA AGCGCGAGGG AGACGGGCGC ACCCCATCCG GTCTTTTCCC GCTGCGCCGG GCTTTCGGTT ACCAGGGGGA ATTCAAGGGT GGCATTCCGT ACCAGCAGGT GGACAGCCAG GACCTCTGGG TAGACGACGT CCATTCCCCC GATTACAACC TTTGGGTATG GCGCGGCCAG ACCGGGGCCT CCTCCTACGA GGAGCTGCTC CGTTCGGATC CCCTATATAA ATACGCGCTG GTGCCGGAGT ACAACGAGGC GCCGGTGGTG CGGGGCCTGG GGAGCGCCAT CTTCGTCCAT GTCGAAAAGG AAAGGGGAGC GGAGACCTCG GGGTGCATCT CCCTGCCGGA AAAGGAGCTG GTGCAGGTGA TGCAGTGGCT GGACCCGGCC CAGGAGCCGC AACTGCTGGT GGCGACCGCG TCCGCGCTGG AGCTGGCGAA CCAGGGGGTG AAGACCCAGC TTCCCGGCGA TCTCCCGCCC GAGATGGCCC AGCGGCTGCA GGAGGCCTCG CGCCTGCTGG CGCTGAAGAG CGGGAACGGG TTCTTTGCCG CCGCGGTGAC GCTCCCCCCG CAGGTGAGCC GGCGGATGCT GGAAAAGGGA TCCTGGCGTC CGGAGTGCCC GGTCCCCCTC GATGAGCTCT CCTACCTGGT CACCTCGTAC TGGGGGTTCG ACGGCCGGCA GCATTACGGG GAACTGGTGG TCCATGCCTC GCTTTGCGCC TTCGTCATGG ACTCGATGCA GCATGCCTTC AACGGCCGCT TCCCCATAGA GCGGATGGAG CTGGCCGAGG CCTTCGACGC CGACGACTTC CTTGCCATGG CCGCCAACAA CACCTCCGCC TTCAACTGCC GCGAGGTCCC CGGGCGTCCC GGCGTCTTCT CCAAGCACAG CTATGGCGCC GCCATCGACA TCAACCCGCT GCAGAACCCT TACCTGCAGG TAAACCCGGA GGCGTACCCA GCCTCTTTCG GGGAGGCCGC CAACGGTAGC TCCGGCGACC CGGCCGTGGC CGCGGCCGAT TTTTGCCGCA ACAACGGTTC GCTCTGCCGC ATACTGCCGG CCGCTTCGGC TTCCTTCCTG GACCGGCGGG ACCTGCGGCC GGGGATGCTG CAGGCTGGCG ATCCGCTGCT CTCCGCGTTC CGGCAGCGCG GCTTTTCCTG GGGCGGGGGC TGGCGCTTCC CCGATTACCA GCACCTAGAG TACGACATCC GCAAACTCCG CCTTGTCTCA GGCCGGCCCT AG
|
Protein sequence | MRPLRTLLGG FALLAVYLSA GCGSSGELPR RIRQDLDTAL LGKSRQLLYV DAVSPASTGA TLYPLQRGLL GWQLASLPVP VNLGRNGVAP PFEKREGDGR TPSGLFPLRR AFGYQGEFKG GIPYQQVDSQ DLWVDDVHSP DYNLWVWRGQ TGASSYEELL RSDPLYKYAL VPEYNEAPVV RGLGSAIFVH VEKERGAETS GCISLPEKEL VQVMQWLDPA QEPQLLVATA SALELANQGV KTQLPGDLPP EMAQRLQEAS RLLALKSGNG FFAAAVTLPP QVSRRMLEKG SWRPECPVPL DELSYLVTSY WGFDGRQHYG ELVVHASLCA FVMDSMQHAF NGRFPIERME LAEAFDADDF LAMAANNTSA FNCREVPGRP GVFSKHSYGA AIDINPLQNP YLQVNPEAYP ASFGEAANGS SGDPAVAAAD FCRNNGSLCR ILPAASASFL DRRDLRPGML QAGDPLLSAF RQRGFSWGGG WRFPDYQHLE YDIRKLRLVS GRP
|
| |