Gene GM21_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1837 
Symbol 
ID8137168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2138019 
End bp2139755 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content63% 
IMG OID644869448 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003021648 
Protein GI253700459 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.00718884 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAATA AGTTATCGGT AAAGATCGTC AGCGTCCTGA TCATGGTCAT GATCGTCATC 
ATGACCGCCT TCTCGGTGTA CTTCGTCCGC TCGCGCAGGC AGAACATGGA GGAGGAGCTC
CTCTCCAAGG GGCGGATCCT GGCGCAGACG GGGGGCAAGT CGATGGAGCG CATCTTGGCC
GAGGCGATAG CCAACCGCGA GCTCACCATG GAGCAGCTCT TCGACGAGCG CTACGTCCCG
ATCCCCAACA CGGAGCCGCA GAAGTACCAC ACCCAGTACG ACAGATTCCT GGACGAGGCG
GTCCAGGCCC TGGAGGACGA ATTCCTCAAG GACGACCAGA TCGTCTTCGC CGCCCTCGTG
GACCGAAACG GCTACCTTCC CACCCACAAC AGCAAGTTCT CCGCCCCGCT CACCGGCGAC
CGGGAGCGGG ACAAGAACGC CAACCGGACC AAGCGCCTGT TCCAGGACGA GGTGGGTCTT
GCCGCCTCCC GCAGTCTGGC CCCGTTCCTG AAGCAGGTAT ACCAGCGCGA CACCGGCGAA
AAGATGTGGG ACCTCTCGGT TCCGGTCTAC GTCCAGGGGA AACACTGGGG GGCGTTCAGG
ATCGGCTTTT CGATGCAGAA GACCGAACAA AAGGGGGCGC AGTTAAGAAA CGAGATCGTG
CTCAGCATGC TGGTCATGCT GATCGCCTGC TCGGTCACCA TCCTCCTCGT GGTAAGCCGG
GCGGTAAAGC CTTTGGCCAA GTTGACCGCG GCCGCCCACC GCATCACCGG CGGCGAGTTG
GATGAGACCA TTCCGGTAGA GAGCAACGAC GAGATCGGCA CGCTGGCCGA GGCCTTCAAC
ACCATGACAA CGGTGATCGT GCGCGACCTG AAGGAGGAGA TCGGCCGCAG CGGGCGCCTG
ATCGCCTCGG TCAAGGAGGC CGTGATCCAG CTCTCCAGCG CCGCGAACGA GATGATGGCG
ATCTCGGCGC AACAGGCCTC GGGTTCGACG CAACAGGCGA GCGCGGTGCA GGAGGTGACC
ACAACTTCTG AGGAGATCGC CATAACCGCC AAGATGATCA CGGCCAACGC CCGCAGCGTA
GAGACCGTGG CCGACGACAC CACCAGCAAC TGCAACAACG GACGCGGGGA CGTCACCAAT
GCCATCGAGG GTATGGGGCG GGTCCGCAGC CAGGTGGAGA GCATCGCCCG CAGCATGCTG
CAATTGGGAG ACAACAGCCA GAAGATCGGC GGCATCGTGG AGATCATCGA CGAGATCAGC
GACCAGACCA ACCTTCTGGC CCTGAACGCA GCCATCGAGG CCGCGGGGGC CGGAGAGGCC
GGGAAGCGCT TCGCCATCGT GGCGCACGAG GTGAAGAGGC TCGCCGACCG CACCGTTGAG
GCGACCCGGC AGATCAAGGG GCTGATCAGC GAGATCCAGA GCGCCACCAA CAACACCATC
ATGGTGACCG AGGAAGGGAC CAAGGCGGTC GACTACGCCT CAAGCCTCGT GGACAAGGTG
CAGCTCTCCT TCGCCTCCAT AGTGGGAACG GCGCAGGAAA CGGCGCGTAC CGCCAAGGAA
ATCTCGCTCT CCACCCAGCA GCAGACCTCC GCCTGCGAGC AGATGGCCGA GACCATGAGC
GAGGTGCGCG ACGTGGCGCA GCAGGTGGCC ATGTCGGCGA CCGAAACCGA GCGGGCCATA
GCCGAAATCC TGGAACTTGC CGAAAGGCTC AAGGAGATCA CGGAGGAAGA GGCGTAG
 
Protein sequence
MSNKLSVKIV SVLIMVMIVI MTAFSVYFVR SRRQNMEEEL LSKGRILAQT GGKSMERILA 
EAIANRELTM EQLFDERYVP IPNTEPQKYH TQYDRFLDEA VQALEDEFLK DDQIVFAALV
DRNGYLPTHN SKFSAPLTGD RERDKNANRT KRLFQDEVGL AASRSLAPFL KQVYQRDTGE
KMWDLSVPVY VQGKHWGAFR IGFSMQKTEQ KGAQLRNEIV LSMLVMLIAC SVTILLVVSR
AVKPLAKLTA AAHRITGGEL DETIPVESND EIGTLAEAFN TMTTVIVRDL KEEIGRSGRL
IASVKEAVIQ LSSAANEMMA ISAQQASGST QQASAVQEVT TTSEEIAITA KMITANARSV
ETVADDTTSN CNNGRGDVTN AIEGMGRVRS QVESIARSML QLGDNSQKIG GIVEIIDEIS
DQTNLLALNA AIEAAGAGEA GKRFAIVAHE VKRLADRTVE ATRQIKGLIS EIQSATNNTI
MVTEEGTKAV DYASSLVDKV QLSFASIVGT AQETARTAKE ISLSTQQQTS ACEQMAETMS
EVRDVAQQVA MSATETERAI AEILELAERL KEITEEEA