Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0761 |
Symbol | |
ID | 8136076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 907850 |
End bp | 909790 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644868378 |
Product | HEAT domain containing protein |
Protein accession | YP_003020593 |
Protein GI | 253699404 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.181899 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGCCC TGAGGAACCA CCCCTTCGCA GGCACTATGC AGCTCATCTT CAAGGCCATG GGGGACGAAA GCTGGCGCGT CAGGAAAGAG GCTGTCTCGG CGGTTCTCCA GGCGCAACCG CTCGAAGCGC CGGTGCTCGA AGCCTTGATC GCCGCGCTGC GCGCCTCGGA AAACGCGGGG CTCAGGAACT CGGCGGTGGA GGCGCTGGAG CTTATCGGCG CCGCCGCGGT GCAGCAGCTC TGCTCGCACC TGAACGACTC CGACCCCGAC CTGCGCAAAT TCATCATCGA CATCCTCGGA AACATCGGCT GCGAAAAGTG CCTCCCGCTC CTGGTCCGGG CGCTCGACGA CGACGACATG AACGTGCGGG TCGCCGCCGC TGAAAACCTG GGCAAGATAG GGGACGTGGG CGCTCTGCCG CACCTGTTGA CGGTGCTGGA GGGGGGGGAG ATCTGGCTCA AGTTCACGGT GCTGGATGCC CTGGCCCTGA TCGGCGCCCC GGTGCCGCTC GTCTCGCTCG CCCCGCTGCT TCAGGAAAGC CTCTTGAGGC GCGCGACGTA TGATTGCCTC GCGGCCTTAG GGGACGCCCA ATGCCTTCCC ATACTGCTGC AGGGGCTGCA GGAAAAGGCG AAGAACGCCC GCGAGGCGGC CGCCGTGGCG CTGATGCGGG TGCGGGAGCG CCTGCCGGCC GAAAAGCGGA CCCCCCTGGT CGACCTTCGC TTGCAGGAAA TGAGCGGCGG CCCGGTCGCG AAGAAGCTGA TCGATTCGCT GCACAGCGAG GACCCCGTGG TCCTGAACGC GCTGGTCCGC ATCGTTGGGA CCGTCGGGGA CACGCGGGCG GCGCTGCCGC TTTTGCACGT CTGCCGCAGC GAACGGCTCA GAGACGCCTG CATCGACGCG TTCCGGCGCA TCGGCCCGAA GCTTTTGCCG GAGCTCGTGG ACCACTTCCC GACCGCTGTT CCCATCGAGC GCGCCGTCAT AGCGCAGCTC ATCGCCGAGT TCGGCGACAC CGGCCATGAA AAGCTCCTCC TCGACGGGCT CCTCGACGAC AGTGCCGAAG TCAGGCGCTC TTGCGCGCTC GCCTTGGGAA GGCTCAAGCC CCAGGGCGCC GTGACGCGTC TGGCCCAACT GCTAGACGAT GGCGAGCCGC AGGTGCGCGA GGCGGCCCTG GAGGGGGTGC GGGCGTTTTC GGCCACCGAA CCCGCCACCC TGAGCGCGCT GGCCTCGGAG TTGACGCACG CGCAACTCCC CGCCAAGAGG CGCAACGCAG CTCTCATACT CGGGGCACTT GCGGACGGCG AGCGGCTCTC TCCGCTGGCG AAGGACGAGG ATGCGACGGT GCGTCAGGCG GCGGTGTCGT CGCTTGCGCG GGTCGAACTA CCCCAGGTGC TCTCGCATCT GGCTCTGGCG CTTTCGGACG AGGAGCCGGA GGTACGCCTG GCCGCGGCGC ATGCCCTCTC CGACCGGGGG GGACCCGAAG CGCTGGCCCC GCTTTTGCTC GCCTTGAACG ATAGCGACCC GTGGGTGCAG ACGGCGGCGC TCAAGGGACT TGCCGCCCTG GGCGACGGCC GCGCTCTTTC CGGCGTCCTC GCGCTGCTGG ACCAGGCCAG CGGCCCGGTG CTGATCGCCG CACTTTCGAC GGTCGCGGCA CTTGGGGGGG CAAATGCCCT CGCCCCTGTG GAGAAGGCGC TATCAAACAG CGACGAAGAG GTGGTTGAGG CGGCGATCGA GATCCTGTCA GGTTTCGGCC GCGGCTGGAT CGATGGGCAC TGCGACGCCC TTCTCGCACA TCCGCACTGG GTTGTGCGGC GCAGCTTCGT GCGCGCCCTG GCGAGGCTTC AGGGGGCTCA GTCCGTGGCG ATCCTGGACC GGGCCCTGGC GGGCGAGCCG GACCAGTTGG TGAGAGGCGA GATCGCGGCA TTGCTCGACA GGCTGCGCTG A
|
Protein sequence | MVALRNHPFA GTMQLIFKAM GDESWRVRKE AVSAVLQAQP LEAPVLEALI AALRASENAG LRNSAVEALE LIGAAAVQQL CSHLNDSDPD LRKFIIDILG NIGCEKCLPL LVRALDDDDM NVRVAAAENL GKIGDVGALP HLLTVLEGGE IWLKFTVLDA LALIGAPVPL VSLAPLLQES LLRRATYDCL AALGDAQCLP ILLQGLQEKA KNAREAAAVA LMRVRERLPA EKRTPLVDLR LQEMSGGPVA KKLIDSLHSE DPVVLNALVR IVGTVGDTRA ALPLLHVCRS ERLRDACIDA FRRIGPKLLP ELVDHFPTAV PIERAVIAQL IAEFGDTGHE KLLLDGLLDD SAEVRRSCAL ALGRLKPQGA VTRLAQLLDD GEPQVREAAL EGVRAFSATE PATLSALASE LTHAQLPAKR RNAALILGAL ADGERLSPLA KDEDATVRQA AVSSLARVEL PQVLSHLALA LSDEEPEVRL AAAHALSDRG GPEALAPLLL ALNDSDPWVQ TAALKGLAAL GDGRALSGVL ALLDQASGPV LIAALSTVAA LGGANALAPV EKALSNSDEE VVEAAIEILS GFGRGWIDGH CDALLAHPHW VVRRSFVRAL ARLQGAQSVA ILDRALAGEP DQLVRGEIAA LLDRLR
|
| |