Gene GM21_0205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0205 
Symbol 
ID8135510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp245236 
End bp246627 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content61% 
IMG OID644867825 
Producttranscriptional regulator, GntR family 
Protein accessionYP_003020048 
Protein GI253698859 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0000000146418 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATCC TCGACCATGA CAGCAAGCTG CCGCTGTACA CCCAACTCTA TGATCAGATG 
AAAGCGAGCG TCCTCTCCGG GAAGCTGTCA TCTCACTCAA AGCTGCTCTC GGTCCGGGAG
TTGGCCGCCG AACTCTCCAT CAGCCGCAAC ACCGTTGAAA ACGCCTACCT TGAGCTTTAC
GCGGAAGGTT ACATCTACAG CAAACCGCGC AGCGGGTATT TCGTTTCGGC ACTGACGCCG
GATCTCTTCA GTTTGCCGTC GCCACGAAAA AAGCTGCTGC GCCCCCCCCT CACCGAGCCG
AAGCCTAAGG GGAGCATCGA CTTCCATCCG GCGCGGCTCG ACCCGGACGC CTTTCCCGCA
TCCCTTTGGC GCAGCTGCTA CCTGGAGTCG TTGCGGCAGT GTCGCGGTGC ACTGGTACAG
TACGGCGACC CGCAGGGGGA GTGGGAGCTA CGGTGCGGCA TCGGGCGCTA CCTGGAGCGG
TCACGCGGAG TGGCCTGCGC CCCGGAGCAG ATCGTCATCT GTTCCGGGCT CCAGCAAAGC
CTCGGCATCG TGGCGCAGAT CCATAACGAG CGGCGTCCGG CGGTAGCACT AGAAAATCCC
GGTTTCCATC TCCCCAGGTC CGTTTTTCAA AACCATGGCT TCGAGACAGT TCCCATACCT
GTCGGCTCCG GCGGCCTCGA CCTCGACGCC CTGGCATCAA GCAACGCCAC CATCGTGTAC
GTGACGCCCT CCCATCAGTT CCCTACCGGC TGTGTAATGC CCATTGCGAA CCGGCTCAAA
CTGATCGAAT GGGGCACTTC AGGCGACCGC TTGATCATAG AGGACGATTA CGACAGCGAG
CTCCGCTACC ACGGCAAGCC GATCCCCTCG CTGCAGGGGC TGCACCCCGA CGGGAACATC
GTCTACCTTG GAACCTTTTC CAAGGTGCTG TCGCCGGCCT TGCGCGTCAG TTACCTGGTA
CTCCCTTACC CGCTTATATC CAGCTATCGG CAGCTTTTCC GCGACTACGC CTGCTCCGTC
TCCCTGTTGG AGCAGGCGAC CCTTGCGCGG TTCATGGAGC AGGGGCACTG GGACCGGCAC
CTGCGGCGGA TGCGCACTCT CTACCAAAAG AAGCACGATG CCATGCTGAG GGCGGTCGAG
ATCCGTTTCG GATCCAAAGC CCTTGTACTC GGACAGGGTG CGGGACTGCA CATGGTGCTG
GAACTCTCCG GGCACGACCT AAGCGAAACG GAACTCATCT CCCGAGCCAG AAGCAAGGGG
ATCGAGCTGT TTCCCTATTC CGCCACCCTT GCCGAAGACG GCAGTTGCAG CAGGGTCGTC
TTGGGCTTTG GCGGCCTGAA GCCGGACGTC ATAGACCGGG GGGTCGAACT GTTGTCCCAG
GCATGGTACT GA
 
Protein sequence
MFILDHDSKL PLYTQLYDQM KASVLSGKLS SHSKLLSVRE LAAELSISRN TVENAYLELY 
AEGYIYSKPR SGYFVSALTP DLFSLPSPRK KLLRPPLTEP KPKGSIDFHP ARLDPDAFPA
SLWRSCYLES LRQCRGALVQ YGDPQGEWEL RCGIGRYLER SRGVACAPEQ IVICSGLQQS
LGIVAQIHNE RRPAVALENP GFHLPRSVFQ NHGFETVPIP VGSGGLDLDA LASSNATIVY
VTPSHQFPTG CVMPIANRLK LIEWGTSGDR LIIEDDYDSE LRYHGKPIPS LQGLHPDGNI
VYLGTFSKVL SPALRVSYLV LPYPLISSYR QLFRDYACSV SLLEQATLAR FMEQGHWDRH
LRRMRTLYQK KHDAMLRAVE IRFGSKALVL GQGAGLHMVL ELSGHDLSET ELISRARSKG
IELFPYSATL AEDGSCSRVV LGFGGLKPDV IDRGVELLSQ AWY