Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1658 |
Symbol | |
ID | 8136989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1928366 |
End bp | 1930027 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644869271 |
Product | hypothetical protein |
Protein accession | YP_003021471 |
Protein GI | 253700282 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 177 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGACG ATCGACACCC GCTTACCCTG AGAGAATACG GCGCCACTGT CGCCATGCTT CTCCTGGGAA TGTACTTTAT ACCTCTGTCG GTCATGCATC TCGACCTGTC CCGCATTCCC GGCGACCTCG TGGACTCACG GCTAAACAAC TATTTTCTTG AGCACGGCTA CAGATGGGTG ACGGGGCAGG TCGCGAGTTT CTGGAACGCT CCGTTCTTCT TCCCTGCGCC CCAGGTGATG ACTCTTTCTG ACAATCACCT GGGCACGCTC CCCCTGTATT CGCTTTTTCG CTTACTTAAT TTCGATCGAG AAACCGCCTA CCAGTTGTGG ATGCTGACTC TTTTCGTTCT CAACTATCTT TGCGCGGTCG TTGTTTTGAA TCTGATGCGC TTCAACCCGG TGGGGGCCGC TGCGGGCGGG TATTTCTTCA CCTTTTCGCT TCCCATGTCA GCCCAGCTCG GCCACATACA ATTGCTGCCC CGCTTCATGA TCCCGCTGGC GTTCTATTTC GTACTCAGAT TTCTCCAGCA AAAGAGGAAC CGCGACCTGG CGGCTGCCTG CGCATGCGTG GTAATGCAGT TTTATTGCGT CATCTATATC GGCTACTTCC TGTCACTGGG GTTGGCTTTC TTCTTCGCAT CATATCTGAT CCTGGCCTGG GACAGGATGG AGATCCGCGC CATCTTGTGG GGGTCGAGAA AGGAGTTCTC CGAGAGAGCG CTGGCATTGG TAGGATCGGG AGCAGCCCTG ATGCCCCTTA TGTATCCCTA CTATCTGCGC TCCATCTATT CCGAGCCAAT CTCCTGGGAC GTCATCTCCT CGATGCTGCC CAGGGCTTAC TCCTACTTTT ACACCTTCAA CGAAAGCTTG ATGTGGAGCT GGCTGTCTGG GCTTGGTAAC GCTCTACCAA TGGCGCATGA GCACCGCCTT TTCGTGGGGC TCCTTCCCCT TCTGGCCTTG ATGCTGCCCC CCCTGCTTTG GATCAGGTCC CCCGATTGGC GGCTGACCTT GATGGGAAAG CTGTTTTTGC TCACCCTGCT GGGAACCACC TTGTTCACCT TTTACGTCGA CGGTGTATCG CTGTATCGGC TGATCTACTG GGCACCCGGT GTTAATGCCA TAAGGGCTCC GGTAAGAATC ATCCTGATGC AGTCATTTCT AGTCGCCGTG ATGGTTGCTC TCGCCACAAC CTTCATGGCG AGCCTGCTGA AGGATCGGCC TGAATGGGCG AAAATAATTC TGGCCGGCGC ACTGCTGATC CTCATGGTCT GTGACCAAGG GTTGGTGCGT ACATACAAAT TGAGTTATGA CAAATCAGCT GCACAAAGCC GTGCGCGGGC CCTCGAAGAA CTGGTGCTAG CGCGGGATCC AAAGGCAAAG CTGTTCCTGT ATCTGCCATC GAACCCCCTT GAGGAGCGGG CAGAGAGGAA CCTCGATGCT ATGTTGGCCG CGCAGCATAT GGGTATATAC ACCGTCAATG GCCACAGCGG ATACGAGCCC CGTGACAACC GCCCCGATCC TCGCAAGCCG AACTACTGCT CACATCTGCG GGAGTGGTTC GTGTCGGCGA AAGCCAATTA CCGTGAGCTG GACAACGTTG ACCTGCTTGA CCATCTCTTG ATTGTTGGCG ATCTTTCCTG TATTGGAGGT GGAGCCGATT AG
|
Protein sequence | MHDDRHPLTL REYGATVAML LLGMYFIPLS VMHLDLSRIP GDLVDSRLNN YFLEHGYRWV TGQVASFWNA PFFFPAPQVM TLSDNHLGTL PLYSLFRLLN FDRETAYQLW MLTLFVLNYL CAVVVLNLMR FNPVGAAAGG YFFTFSLPMS AQLGHIQLLP RFMIPLAFYF VLRFLQQKRN RDLAAACACV VMQFYCVIYI GYFLSLGLAF FFASYLILAW DRMEIRAILW GSRKEFSERA LALVGSGAAL MPLMYPYYLR SIYSEPISWD VISSMLPRAY SYFYTFNESL MWSWLSGLGN ALPMAHEHRL FVGLLPLLAL MLPPLLWIRS PDWRLTLMGK LFLLTLLGTT LFTFYVDGVS LYRLIYWAPG VNAIRAPVRI ILMQSFLVAV MVALATTFMA SLLKDRPEWA KIILAGALLI LMVCDQGLVR TYKLSYDKSA AQSRARALEE LVLARDPKAK LFLYLPSNPL EERAERNLDA MLAAQHMGIY TVNGHSGYEP RDNRPDPRKP NYCSHLREWF VSAKANYREL DNVDLLDHLL IVGDLSCIGG GAD
|
| |