Gene GM21_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1839 
Symbol 
ID8137170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2140364 
End bp2142199 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content64% 
IMG OID644869450 
ProductMCP methyltransferase, CheR-type with Tpr repeats 
Protein accessionYP_003021650 
Protein GI253700461 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG1352] Methylase of chemotaxis methyl-accepting proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.00242701 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCCATTGC AGATCCTCCT CATAAGCGAC AACGCTTCCT TTGCGGACTA CCTGAACCAT 
CTCCTGGGCG AGGCGGGACA TGCGGTGACG ACGCTCCATG ACCCAAGCCA GGGGTTTCGC
GCGGTCAGGC GGCAGAAGCC GGAGCTGATC ATCCTCGCCG TTGAGCCGGC GAAGCTTGCC
CCTTTGGCGG TTGAGCACCG CGTCCATACG CCGGAACCGA GAGTCATTCC GGTGATAGTA
ATCTCGGAAT GCCTGAGGCT GGAAGCGGAG CTCTTGCACG TCTTCGATTT CATCCCCAAG
CCACTCCAGA TAAAACGGCT TTTCGACGAT CTCACCTTCC TTTCGCAGCA AAACGCGCCG
TCGTGCACGC AGCACGAGAT CGACGAAGAG CTTTGCTGCG TGTTCTCCAA ACACATCCTT
AGTTGCACCG GGCTCCACTT CGAACAGCGC AACCGCGCCG CACTTTTGCG GGGGCTCGCC
AAGCGGATGT ACGCCCTGCG CATCGGCAGC CATCGCGACT ATCTCGCCTA CCTGAAGCTG
CACGGCGAAG ACCGCCACGA ACTGCAGAAA CTCCTGCAGT TCCTCACGGT GGGAGAAACC
TACTTTTTCC GCTATCCCGC CCACTTCGCC GCCCTCAGAG AGCGTTTCAA CCCACCTCCC
CCCGTCGACC GGCCGATAAG GATCTGGTCG GCCGGATGCT CCACGGGGGA GGAGCCGTAC
TCGATTGCGA TCACCCTGAT GGAGGCGCTC CCGGACTGGA GGGACCGCGA CATCAGGATC
GTCGCCACCG ACATCAACAA CCGCTCGCTC AAGCTGGCGC GCGAGGGGGT TTACTCCCCC
TGGTCGCTGC GGATTACGCA GGGGGAGCAG ATCGGGCGCT ACTTCGACCG GGTCGGCCAG
AGCTTCCTGA TCAAGGACGA GGTGAAAAGG CTGGTCCATT TCCGTCACCT GAACCTATCA
GGTCCGGGCC ACGACGAGAT GTGGCACGAA CTCTCGGAGC TGGACGCCAT TTTCTGCCGC
AACGTCCTCA TCTACTTCAC GCCGCAGGCA GCAGACGAGG TGCTGCGGCG CCTAGCCGAC
GCCCTGAAGA TCTCCGGCCA GCTCTTCCTT GGGCACGCCG AGACGCTGCT GCAGCAAGAC
AGCGAGCTGG AGATACGGCG CCAGGGGAAA ACCTTCTTCT ATCTTAAAAG CGGGCCACGC
ACCCCTCAGA CGCCCCAGCC CCCGCCCTGC GTGAAACCGG CGCCGCAAGC CATGCCGGCC
GCTTCGACCG TGCCGGCACA GGCGGCGCCC AAAGTAGTTT CACCTCCCCC GCCGGAGGCC
TGCGCCCCGC CCCCCCCTGC TCCGCTTGAT GTCGAAGCGG CCCGCGAGCT CTTCGACCGG
GAGGAATTCG ACCGGGCACA GGAGCTGCTG GACCGGATAC TCGCGGAAGA TCCTGCCAAC
GCGGCGGCGC TGGTGCTGGT CGCCTTCATC CAGGCGGGAA AGGGGGCGCT GCAGCAAGCG
CTCAAGAGCT GCAGCAGGGC ACTGGAACTG AACGACCTCC TGCCGGAAGC GTACTTTCTC
AAGGGGGTGA TCCTCGACGC CGAGGACCGC CTGGCCGAGG CGGCCGACGA GTACAGAAAG
GCCCTTCTTT TGGAGCACGA GTTCGTCATG CCGCGCTACC ACATGGGGAG GCTGCATCTG
AGACTGGGGC GCCAGGCCGA GGCGGCGCGC GAGATCAGAA ACAGCATCAG GATCCTCGCC
CGGCACGACG ACAACGATAC CGTCCCCTTC TCGGGCGGCC TTACCCGGGC CGTCTGCATG
ATGCAACTGC AAAACGCACT GGCGCAGGTC GCCTGA
 
Protein sequence
MPLQILLISD NASFADYLNH LLGEAGHAVT TLHDPSQGFR AVRRQKPELI ILAVEPAKLA 
PLAVEHRVHT PEPRVIPVIV ISECLRLEAE LLHVFDFIPK PLQIKRLFDD LTFLSQQNAP
SCTQHEIDEE LCCVFSKHIL SCTGLHFEQR NRAALLRGLA KRMYALRIGS HRDYLAYLKL
HGEDRHELQK LLQFLTVGET YFFRYPAHFA ALRERFNPPP PVDRPIRIWS AGCSTGEEPY
SIAITLMEAL PDWRDRDIRI VATDINNRSL KLAREGVYSP WSLRITQGEQ IGRYFDRVGQ
SFLIKDEVKR LVHFRHLNLS GPGHDEMWHE LSELDAIFCR NVLIYFTPQA ADEVLRRLAD
ALKISGQLFL GHAETLLQQD SELEIRRQGK TFFYLKSGPR TPQTPQPPPC VKPAPQAMPA
ASTVPAQAAP KVVSPPPPEA CAPPPPAPLD VEAARELFDR EEFDRAQELL DRILAEDPAN
AAALVLVAFI QAGKGALQQA LKSCSRALEL NDLLPEAYFL KGVILDAEDR LAEAADEYRK
ALLLEHEFVM PRYHMGRLHL RLGRQAEAAR EIRNSIRILA RHDDNDTVPF SGGLTRAVCM
MQLQNALAQV A