Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1007 |
Symbol | |
ID | 8136329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1186285 |
End bp | 1188285 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644868620 |
Product | hypothetical protein |
Protein accession | YP_003020828 |
Protein GI | 253699639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 113 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTTT TAGCCTTTTG TAGCCACTAC CTGCCGCCGA TTGCGTGTGG TGGTACCCTC TGCTGGCTTG CCTGGAAATC TAACAAGGAA TTCATTCAAC CATCCAAGAT TCTTAAACGA GAACTGACTG AAGCCATCAA AAACCTGAAG GAATTGCGAC ATGGTGCCAG TGGCCAGCCG GTCACCGATC TTGCCGTTAT AGCGGATAAG GTCATGACAG GAGAAGTGCT TCGCCATCTC TGGAGCGAGT ACCAAGAGAC GCTGCACCCG CAAAAAGGGA TAGATACCGA CGGTTTCGAG CGCATTGTCC GCTACCGCTC GACCGCCCTC TCTGAGACTT TTTTCACCGA CCAGGCACTA GTAGACACAC CGCTAAAGAC TGCTTTTTTC AAACATCTAC CGGGTATCCT GACTGGAATT GGCATCATAG GGACGTTCTT TGGACTTATC ACTGGGCTAA AAAGCTTTGA GGTTTCCGCC AACGCCGATC TTGTCAGAAA GAGCCTTAGC GGTCTTTTGT CCAGCGTAGG CGATGCCTTC ACAGTTTCAC TGATGGCGAT CGCATCCGCC ATGATTTTCA CTTTTATCGA GAAAACAGCG GTTACTCAAT GCTATGGACA CGTTGAGGAG CTTACCCAGC TCATCGACAG CTTGTTTGAC GCGGGAGCCG GCGAGGAGTA CCTTTCCCGC TTGGTGGCGG CCTCCGAAAC ATCTGCCACG CAAGCGACCC AACTGAAAGA CGCGCTAGTG TCAGAGTTCA AGCAGATCAT GGCTGAAGTC ACCGAGTTGC AGGTCTCCGC AGCCGCTAAA CACAGCTCGG CCATGTCCTC CGCCATCGTC GAGAGCTTCA CCGAAAGTAT ATCCGAGCCA ATGGAGAGGA TCTCGCGGGC GGTGGATAAC GTCGGGACCA ACCAAGGGGA TGCGGTCAAC AGGCTTCTAA CCGACGTGCT TGCTAACTTC AGTTCCCAGA TGGAGGGTAT TTTTGGGGGG CAACTGCGCG GGATAAACGA GCTCCTGATC CAAACTACCG AGACGATGCA GGGTGTGTCG TCGAGATTCG AAACGGTCGC CGCTGGGGTG CAAACTGCGG GGGAGAGCGC CGCAGACGCC ATGGCGGAAA AACTCTCCCA AGCCATAAGC TCCATTGAGG CAAGACAAGA GATCATGAAT GCCCAGATGG GCGAATTTGT GGTGCAGATC AAAAATCTTG TCCATGAGTC GCAAACCGAG ACTTCCCAGA AGATGCAGGG GATTCTGGCC GATTTGGGAG AAAAGGTGTC CGGGATGGTG ACGCAGTTGG AGGAGCAGTC CCGCGAGAGT ACCAAGTTGC ACCAGACCAG CCAGATCCAG TTTGCAGAGA CGACGAACGC AACGGTCGAC GGGATCGGCG GCATGGTACA AGCACTCGCC GAGGAGGTGC AGTCAGCTAG CGATGCCATG CGGCAGAGTG TGTCCAGCCT GTCCCAGTCA AGCCGGGAGT CGATTGACAA GCTCAACAGC GGTGCCGAGG TACTCTATAT GGCCTCCAGC GAATTCGCTA AGGCAGGCAA AGGGGTAACG GACACGGTCC GGGAAAGCGG CCGAGCGGTC GAGACGATTA CCGGGGCGAC AAACCTGCTA GGGGCAATTG TGAAGGACGT CCGGAGCATC CTGAGTGAAA ACGAAAGGGC GAAGGATACT TTCGGCGCGA TGGTGAACGA CCTCAGGAGC CTGGTGGAAA ACGCAAAACG CGAGGCCTCA ATGTCTGAGG AGGTGATCGC CTCCATCAGG CACGCCGCCG AGCAGCTGAG TTCCGCAGGA CAGCAGGCCG AAGATTACCT GCGGGGAGTC ACCGAAGTCT TGGGCAATGC CCACGTGGAG TTCGCCCGCA ACATCGAGCG TACCCTGAAC CATGGCAATA CTACGTTCCA AAAGGAGCTC TCCATGGGGG TTGGCTTGCT CAACATCGCA ATCAAAGAAC TGGGAGACAC CATCGACGAG TTCCCCAGGG GTAACGCGTG A
|
Protein sequence | MDVLAFCSHY LPPIACGGTL CWLAWKSNKE FIQPSKILKR ELTEAIKNLK ELRHGASGQP VTDLAVIADK VMTGEVLRHL WSEYQETLHP QKGIDTDGFE RIVRYRSTAL SETFFTDQAL VDTPLKTAFF KHLPGILTGI GIIGTFFGLI TGLKSFEVSA NADLVRKSLS GLLSSVGDAF TVSLMAIASA MIFTFIEKTA VTQCYGHVEE LTQLIDSLFD AGAGEEYLSR LVAASETSAT QATQLKDALV SEFKQIMAEV TELQVSAAAK HSSAMSSAIV ESFTESISEP MERISRAVDN VGTNQGDAVN RLLTDVLANF SSQMEGIFGG QLRGINELLI QTTETMQGVS SRFETVAAGV QTAGESAADA MAEKLSQAIS SIEARQEIMN AQMGEFVVQI KNLVHESQTE TSQKMQGILA DLGEKVSGMV TQLEEQSRES TKLHQTSQIQ FAETTNATVD GIGGMVQALA EEVQSASDAM RQSVSSLSQS SRESIDKLNS GAEVLYMASS EFAKAGKGVT DTVRESGRAV ETITGATNLL GAIVKDVRSI LSENERAKDT FGAMVNDLRS LVENAKREAS MSEEVIASIR HAAEQLSSAG QQAEDYLRGV TEVLGNAHVE FARNIERTLN HGNTTFQKEL SMGVGLLNIA IKELGDTIDE FPRGNA
|
| |