Gene GM21_0761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0761 
Symbol 
ID8136076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp907850 
End bp909790 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content68% 
IMG OID644868378 
ProductHEAT domain containing protein 
Protein accessionYP_003020593 
Protein GI253699404 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.181899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGCCC TGAGGAACCA CCCCTTCGCA GGCACTATGC AGCTCATCTT CAAGGCCATG 
GGGGACGAAA GCTGGCGCGT CAGGAAAGAG GCTGTCTCGG CGGTTCTCCA GGCGCAACCG
CTCGAAGCGC CGGTGCTCGA AGCCTTGATC GCCGCGCTGC GCGCCTCGGA AAACGCGGGG
CTCAGGAACT CGGCGGTGGA GGCGCTGGAG CTTATCGGCG CCGCCGCGGT GCAGCAGCTC
TGCTCGCACC TGAACGACTC CGACCCCGAC CTGCGCAAAT TCATCATCGA CATCCTCGGA
AACATCGGCT GCGAAAAGTG CCTCCCGCTC CTGGTCCGGG CGCTCGACGA CGACGACATG
AACGTGCGGG TCGCCGCCGC TGAAAACCTG GGCAAGATAG GGGACGTGGG CGCTCTGCCG
CACCTGTTGA CGGTGCTGGA GGGGGGGGAG ATCTGGCTCA AGTTCACGGT GCTGGATGCC
CTGGCCCTGA TCGGCGCCCC GGTGCCGCTC GTCTCGCTCG CCCCGCTGCT TCAGGAAAGC
CTCTTGAGGC GCGCGACGTA TGATTGCCTC GCGGCCTTAG GGGACGCCCA ATGCCTTCCC
ATACTGCTGC AGGGGCTGCA GGAAAAGGCG AAGAACGCCC GCGAGGCGGC CGCCGTGGCG
CTGATGCGGG TGCGGGAGCG CCTGCCGGCC GAAAAGCGGA CCCCCCTGGT CGACCTTCGC
TTGCAGGAAA TGAGCGGCGG CCCGGTCGCG AAGAAGCTGA TCGATTCGCT GCACAGCGAG
GACCCCGTGG TCCTGAACGC GCTGGTCCGC ATCGTTGGGA CCGTCGGGGA CACGCGGGCG
GCGCTGCCGC TTTTGCACGT CTGCCGCAGC GAACGGCTCA GAGACGCCTG CATCGACGCG
TTCCGGCGCA TCGGCCCGAA GCTTTTGCCG GAGCTCGTGG ACCACTTCCC GACCGCTGTT
CCCATCGAGC GCGCCGTCAT AGCGCAGCTC ATCGCCGAGT TCGGCGACAC CGGCCATGAA
AAGCTCCTCC TCGACGGGCT CCTCGACGAC AGTGCCGAAG TCAGGCGCTC TTGCGCGCTC
GCCTTGGGAA GGCTCAAGCC CCAGGGCGCC GTGACGCGTC TGGCCCAACT GCTAGACGAT
GGCGAGCCGC AGGTGCGCGA GGCGGCCCTG GAGGGGGTGC GGGCGTTTTC GGCCACCGAA
CCCGCCACCC TGAGCGCGCT GGCCTCGGAG TTGACGCACG CGCAACTCCC CGCCAAGAGG
CGCAACGCAG CTCTCATACT CGGGGCACTT GCGGACGGCG AGCGGCTCTC TCCGCTGGCG
AAGGACGAGG ATGCGACGGT GCGTCAGGCG GCGGTGTCGT CGCTTGCGCG GGTCGAACTA
CCCCAGGTGC TCTCGCATCT GGCTCTGGCG CTTTCGGACG AGGAGCCGGA GGTACGCCTG
GCCGCGGCGC ATGCCCTCTC CGACCGGGGG GGACCCGAAG CGCTGGCCCC GCTTTTGCTC
GCCTTGAACG ATAGCGACCC GTGGGTGCAG ACGGCGGCGC TCAAGGGACT TGCCGCCCTG
GGCGACGGCC GCGCTCTTTC CGGCGTCCTC GCGCTGCTGG ACCAGGCCAG CGGCCCGGTG
CTGATCGCCG CACTTTCGAC GGTCGCGGCA CTTGGGGGGG CAAATGCCCT CGCCCCTGTG
GAGAAGGCGC TATCAAACAG CGACGAAGAG GTGGTTGAGG CGGCGATCGA GATCCTGTCA
GGTTTCGGCC GCGGCTGGAT CGATGGGCAC TGCGACGCCC TTCTCGCACA TCCGCACTGG
GTTGTGCGGC GCAGCTTCGT GCGCGCCCTG GCGAGGCTTC AGGGGGCTCA GTCCGTGGCG
ATCCTGGACC GGGCCCTGGC GGGCGAGCCG GACCAGTTGG TGAGAGGCGA GATCGCGGCA
TTGCTCGACA GGCTGCGCTG A
 
Protein sequence
MVALRNHPFA GTMQLIFKAM GDESWRVRKE AVSAVLQAQP LEAPVLEALI AALRASENAG 
LRNSAVEALE LIGAAAVQQL CSHLNDSDPD LRKFIIDILG NIGCEKCLPL LVRALDDDDM
NVRVAAAENL GKIGDVGALP HLLTVLEGGE IWLKFTVLDA LALIGAPVPL VSLAPLLQES
LLRRATYDCL AALGDAQCLP ILLQGLQEKA KNAREAAAVA LMRVRERLPA EKRTPLVDLR
LQEMSGGPVA KKLIDSLHSE DPVVLNALVR IVGTVGDTRA ALPLLHVCRS ERLRDACIDA
FRRIGPKLLP ELVDHFPTAV PIERAVIAQL IAEFGDTGHE KLLLDGLLDD SAEVRRSCAL
ALGRLKPQGA VTRLAQLLDD GEPQVREAAL EGVRAFSATE PATLSALASE LTHAQLPAKR
RNAALILGAL ADGERLSPLA KDEDATVRQA AVSSLARVEL PQVLSHLALA LSDEEPEVRL
AAAHALSDRG GPEALAPLLL ALNDSDPWVQ TAALKGLAAL GDGRALSGVL ALLDQASGPV
LIAALSTVAA LGGANALAPV EKALSNSDEE VVEAAIEILS GFGRGWIDGH CDALLAHPHW
VVRRSFVRAL ARLQGAQSVA ILDRALAGEP DQLVRGEIAA LLDRLR