Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1641 |
Symbol | |
ID | 8136972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1910213 |
End bp | 1911931 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869254 |
Product | hypothetical protein |
Protein accession | YP_003021454 |
Protein GI | 253700265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 193 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGATCGT GGGCCAATAA CACTACCTCG CGCCACACGC TGCTCGTCTG CCTTTTGCTG GTCTCAATGA TCCTAGGGGT CTATTACCCC GCGCTGTTGA GCGGTTTTCA TCCGGTCGAC GACCCCGGGA TCGTCTCCCT GTACACCTCT TCGCCGCCAC TGTCTCAGAT CCTGCTCCCC GGGCACGGGT ACTATTACCG TCCGCTGCTG GAGCTCTCCT TCTGGCTGGA CTCAGTATTG TGGGGGATGG AAGCCCGGGC CATGCACCTG GAGAACGTCC TGCTGCATTG CCTGAATGCT CTGCTGGTTT TCCTGCTGGC CGGCAAGGCG AGCGAACGGG TGGGCGGAGA ACCTCTTCTG GGTGGCACGC TCGCGGCTGC GCTTTTTTCC CTCCACCCTG TGCATGTGGA GGCGGTGGCT TGGGTAGCGG GGAGGTCGGA CCTTCTGCTG ACCCTCTTCG TACTCAGCGC CTTGTGGCTT TGGCTGTGCT GGCTCTCCTC TCCCAGGCCG CGGGACCTTG CCTTGTCGTT GCTGCTGATG GTGGCCGCCT TGCTCACCAA GGAGACGGCG CTTGTCACAG GAGGAGTTTT CCTGCTCATG GCACTAGTGT GGCCCGGGGC CGCTTCAGCC CGGCAGCGGG GTGCCGCGCT TGGCCTGATG GCCGGCCCCG CGCTGCTTCT TGCGCTGTTC GCCTTGCTGT TTCACAGCGG GACCAGCGGT CTTAGCCGCT TCACCTCTGC TACGGCACTT CATCGCTGGA GCGCTTGCAA CGACGCCCTT GTTGCCCTCG GTTTTTACGC TCGAAAACTT CTCTTCCCGC TGCCGCTGAA CTTCGCCATC ACCGAAGTCG ACCCGCGCCA TGCGTTGCTT GGCGCTGCAC TCTTGCCGGC ACTTTCATGG CTGTTCCTGC GCCGCCGGCT TTCAGCTGCC TTCTTCGCCT CGGCGCTTTT GATGGTGCTT CCCGCCGTAC TCATTGCTGT GAACCAGATA GCCTGGACCC CGTATGCCGA GCGCTACCTC TATCTCCCTT GCGCCTGCTG CGCCGTGGCA TTGGTATCGC TGGTCCCGAC AAAGGCCAGC GCGTCGCGCC CGGCCGTTTG GGCGCTCGTG GGAACGCTGG TGGTGGCCTA TGCCGCAGTC GACTTACAGC GCACCCTGCT TTGGAAGGAC AAACTTGCCT TCTTCCGTGA AGCCGTCGTC AGATCACCCC GCTTCGGCAC GGTCTATAAC GAACTGGGGG GGCAACTTCT TGAGCAGGGG AAGCTTGAAA AGGCTGCCGA GGCATTTGCC ACGGCTGACC GGCTCAACAA ACGCGACTCC ATGAGGCTGC TGATAAAGGC CAACTTGTTG GGTGTTGCCT TTGCCCGTCA GGATTATCTG AAGGTAAGGA CCCTTTTCTT CCAAACGTTC AAGGACAAGA GGGCGGCCAA TGCAGATTTC CTCGATCTTC TGCAGAAAGC CGACAGCAGG AGACTCCCCA CGCTTTCCGG GGATGCCGAA GTCGCCCTCA CTCGGGATAT CCTGGAAACG CTGGATTTAT TAGGCGAAAA ACGCTATGAT CCCTTCTGGC TTTACCGCAG CGGGCAGTTC TCGCTTGCCA TCGGAGACAA GGCCAAGGCG GCGGAGTTTT TCTCGCGTGC CTACCGCCTC GCGCCTGCCG GCGCGCATTA CAAAGAGCCG GCGAAGGTGT ACCTGCGTAA ATTGGAACCT GCTCGATGA
|
Protein sequence | MRSWANNTTS RHTLLVCLLL VSMILGVYYP ALLSGFHPVD DPGIVSLYTS SPPLSQILLP GHGYYYRPLL ELSFWLDSVL WGMEARAMHL ENVLLHCLNA LLVFLLAGKA SERVGGEPLL GGTLAAALFS LHPVHVEAVA WVAGRSDLLL TLFVLSALWL WLCWLSSPRP RDLALSLLLM VAALLTKETA LVTGGVFLLM ALVWPGAASA RQRGAALGLM AGPALLLALF ALLFHSGTSG LSRFTSATAL HRWSACNDAL VALGFYARKL LFPLPLNFAI TEVDPRHALL GAALLPALSW LFLRRRLSAA FFASALLMVL PAVLIAVNQI AWTPYAERYL YLPCACCAVA LVSLVPTKAS ASRPAVWALV GTLVVAYAAV DLQRTLLWKD KLAFFREAVV RSPRFGTVYN ELGGQLLEQG KLEKAAEAFA TADRLNKRDS MRLLIKANLL GVAFARQDYL KVRTLFFQTF KDKRAANADF LDLLQKADSR RLPTLSGDAE VALTRDILET LDLLGEKRYD PFWLYRSGQF SLAIGDKAKA AEFFSRAYRL APAGAHYKEP AKVYLRKLEP AR
|
| |