Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1000 |
Symbol | |
ID | 8136322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1180315 |
End bp | 1181607 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644868615 |
Product | hypothetical protein |
Protein accession | YP_003020823 |
Protein GI | 253699634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 115 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCGA ACATTATGAT GCCCACACCG AACATGGCTG CCTCTTCTGA GGCCTCGGAT GAACTGGTTA AACGGTTGGC AGAACTGAGC GAGGTGCAGT ATGACCAGGT GCGTAAAGAG CAGGCGAAGG TACTTGGGAT TAGGCCGTCG ACACTTGATG CTGCAATCAA AAAGGCGAGA AAGCCTTCTG ATACTGCGGC AATCCTCTTC GAGGAAGTCG AGCCTTCTCC CATGCCGGTC GATCCTGGCA TCCTGCTGAG CGACCTAAGA GACACTATTC GCCGCTTCGT TGTCTGCACT GAGCAGGCGG CTATAGCGGG TGCACTCTGG ATAGCCATGA CTTGGTTCAT AGATGTTGTC TCGGTCGCAC CTCTGGCTGT AATTACCGCG CCCGAGAAAC GCTGTGGCAA GAGCATTTTG CTTGGATTAT TTGAGAAACT GACCATGAAA CCCTTAGCGG CAAGCAATAT CACGCCGGCA GCCTTCTTCC GTGCAATTGA TGCATGGTCT CCGACCTTGT TGATCGATGA AGCTGACGCC TTTATGAAGG ACAACGAGGA ACTACGGGGG CTGCTCAACA GCGGACATAC GAGGGCCTCT GCCTACGTGA TCCGCACGGT CGGTGACAAC TTCACGCCAA CTAAGTTTAA TACCTGGGGA GCCAAAGCCC TGGCAGGCAT CGGAAACCTA CCTGGTACCG TGATGGACCG CTCCATTGTC TTAGAACTGC GGCGCAAGCT CGCTACTGAG ACAGTTGAGC GCCTTCGGTA TGCTGATGAG CAAGTATTTA AGGATCTCAA GGGGAGGCTG GCGAGGGTTA GGGAGGACTA CATGGAGACG TTACACAAGG TTCGCCCACC CTTGCCGGCG CAACTCAACG ACAGAGCGAT GGACAACTGG GAGCCACTGC TGCAGATAGC AATGCTAGGT GGCGAAGCCT GGTTTGAAAC TGCCACTAAG GCGGCAATTA AGATAGCAGC CAAGGAAACT GGGACTATAA CGACTGGTAC CCAGTTGCTG GTAGACATAA GGGATATCCT CAACTCGAGA AGCAGCGACC GAATCAGCTC CCAGGACCTG ATCGACGCAT TATGCCTGGA CGCCGAGTTG CTGTGGTCCA CCTACAACAG GGGAAGGGCG ATCTCTACTC GTCAGCTTGC GATGTTGCTT AAACTGTTTG GCATCCATAG CAAGACAATC CGTTACAACA ACAGCACGGC TAAAGGCTAT GAAGCGGAGC AATTCCTCGA TGCCTTTTTC CGTTACATTC CCCCTGTCAC TCTAGAGCAG TAA
|
Protein sequence | MQSNIMMPTP NMAASSEASD ELVKRLAELS EVQYDQVRKE QAKVLGIRPS TLDAAIKKAR KPSDTAAILF EEVEPSPMPV DPGILLSDLR DTIRRFVVCT EQAAIAGALW IAMTWFIDVV SVAPLAVITA PEKRCGKSIL LGLFEKLTMK PLAASNITPA AFFRAIDAWS PTLLIDEADA FMKDNEELRG LLNSGHTRAS AYVIRTVGDN FTPTKFNTWG AKALAGIGNL PGTVMDRSIV LELRRKLATE TVERLRYADE QVFKDLKGRL ARVREDYMET LHKVRPPLPA QLNDRAMDNW EPLLQIAMLG GEAWFETATK AAIKIAAKET GTITTGTQLL VDIRDILNSR SSDRISSQDL IDALCLDAEL LWSTYNRGRA ISTRQLAMLL KLFGIHSKTI RYNNSTAKGY EAEQFLDAFF RYIPPVTLEQ
|
| |