Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1087 |
Symbol | |
ID | 8136409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1273686 |
End bp | 1275479 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868698 |
Product | surface antigen (D15) |
Protein accession | YP_003020906 |
Protein GI | 253699717 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0729] Outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000000000374236 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCTGACAC TGATAAAGGT CCGCCCGATG CCCAAACGGT TTCCTCCATA CCTAGCTCTT ATCTGCCTCG CGGCGCTGCT TCTGCGCGCG CCTGCCGCCT TCGCGGCCGA TCCGGTCGAG ATCGCGGTGA CGGGTGTGGA AGGCGATCCC CTGGAGAACG TGAGGCAGGC GCTGGCGCTT CCCTACGGCC TGGTCCGGGA GGGGAAAGTG GACCGGCTCT GGCTGGACCG CTTCGCCAAA AAGGCGCCGG ACAAGGTACG TCAGGCGCTT GAGCCTTACG GTTACTACAA GTCGGAGGTT TCGGCCCGGG TGGCTCGGAC ACGGGACGGC AAACCGAGCC TGCAGGTCGT CGTCATCCCG GGAGAACCGG TGCTGGTCAC CGAGGTGACG GTGGAGTTGA AGGGGGCGGG AGCTAAGGCG AGAAGGCTTT CGCGGGTGCG GGACGCCTTT CCGGTGAGGA GGGGGGAGGT GCTGCTGCAA CCGGATTACG AGCGGGGCAA GGGGGCGCTG CAGTCGCAGG CCCGGAAGCT TGGCTACCTC GACGCCTCCT TCCCCCGCCA CGAGATCCGC ATCAGTGAAG ACCGCACGAA AGCCAGGATC GCCCTGGTGC TGGACACCGG CCCCCGTTTC TATTTCGGCC CGGCGGTCAT TCAGGGAGCG CCCGATTACC CCGAGTCCTA CCTGAAGCGG TTCGTCGCCT TCAAGGAAGG GGAACCTTTT TCCTACGCGA AGATCGGGGA GACGCAGCTC AATTTCGCCA ACTCCGAGCG CTTCAGGCAG GTGGTGGTCA CGCCGGAGCG CGAGCGGGCC GAGAACGCCC GGGTACCGGT CGTGGTGCAG CTTACCGAGG CCCCGCGCCG CACGGTAAGG CCCGGGATCG GCTACGGGAC CGACACCGGC GCCAGGTTCT CCACCCACTA CCGGGACTTG AACCTCTTCC ACAAAGGGCA CGACCTCGAC TTGAGCCTCT ACGCGGCGCA ACGCCTGCAG GGATTCGCCG GGCGCTACAC CATTCCGAGC AGCAGCGATT ACCGAAGTTC CACCGCCTTG CAATTGAACC TGCAAAAGGA AGACGTCACC AATTACTTGA GCAAGATCGT CGCTTTGGAG CTGGACCGAA ACATTGGCCT TGGGCGCGGC GAGCTGGCGA CGGCGTACGT GAGGCTGCTG CAGGAGGTCT TCACCATCGG CGACGAGAAC GCGAACTCCA GGCTGGTGCT TCCTGGATTC CGCTTCTCCA AGGAAACCTT CAACAACATG GTCCGCCCCA GGCGCGGTTA CGCCTACACC TTGGAGCTGC GCGGCGCTCA CCCGTATCTG GGATCCGACA CCGGCCTCAT CCAGGGGATC GCCCATGCCA ACCTGCTGTT CCCGCTACCC TGGCGCCTCT CCCTGCAAAG CCGGGGGGAC GCGGCGTACA GCCTGCTCGA TGACCCCTTC TCGGAACTTC CCCCCTCCAT CCGCTTCTTT GCCGGCGGCG ACCAGAGCGT GCGCGGCTAT TCCTACCAGA GCTTGGGCCC CAGCGACTCC TCCGGGAAGG TGGTGGGGGG GAGGCACCTG CTGGTGGGGA GCCTGGAGCT TTTGCGCGCC CTCTACAAGG ACTGGGGGGT GTCGGTCTTC TACGACATAG GGAACGCCTT CAACAACTAC GCGGACATGC GCCTGAAAGA CGGGACCGGC GTGGGCATCC ATTACTACAC CGCGGTCGGG GGGCTGAACC TCTACCTCGC CAAGCCGCTT GCCACCGGTG CGGGGAGCTA TCGCATCCAT TTCACCGTGG GGTTCCAGCT ATGA
|
Protein sequence | MLTLIKVRPM PKRFPPYLAL ICLAALLLRA PAAFAADPVE IAVTGVEGDP LENVRQALAL PYGLVREGKV DRLWLDRFAK KAPDKVRQAL EPYGYYKSEV SARVARTRDG KPSLQVVVIP GEPVLVTEVT VELKGAGAKA RRLSRVRDAF PVRRGEVLLQ PDYERGKGAL QSQARKLGYL DASFPRHEIR ISEDRTKARI ALVLDTGPRF YFGPAVIQGA PDYPESYLKR FVAFKEGEPF SYAKIGETQL NFANSERFRQ VVVTPERERA ENARVPVVVQ LTEAPRRTVR PGIGYGTDTG ARFSTHYRDL NLFHKGHDLD LSLYAAQRLQ GFAGRYTIPS SSDYRSSTAL QLNLQKEDVT NYLSKIVALE LDRNIGLGRG ELATAYVRLL QEVFTIGDEN ANSRLVLPGF RFSKETFNNM VRPRRGYAYT LELRGAHPYL GSDTGLIQGI AHANLLFPLP WRLSLQSRGD AAYSLLDDPF SELPPSIRFF AGGDQSVRGY SYQSLGPSDS SGKVVGGRHL LVGSLELLRA LYKDWGVSVF YDIGNAFNNY ADMRLKDGTG VGIHYYTAVG GLNLYLAKPL ATGAGSYRIH FTVGFQL
|
| |