Gene GM21_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1105 
Symbol 
ID8136427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1294500 
End bp1296191 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content61% 
IMG OID644868716 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003020924 
Protein GI253699735 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones111 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGT TCAGCAACCT CAAAGTCGGC ACCAAGCTAA TATCGGCATT CATCGTGGTC 
TCGCTCATCA CCGCCATCGT CGGCTATATC GGCATCCGCA ACATGGGTGC CATCAACGAC
ATGGCTGATG AAATGTATCA GAAGGAACTG CTTGGGATCT CCTACATCAA GGAGGCCAAC
ATCAACCTCA TCTACATCTC GCGGGCCGAG AAGAACTTCC TGCTGGCCAC CAGCGCGCAG
GAGCGGGAGA CGTCCCAGGG CAACATCAAC AAGTACAAGG CGGGATACAA GGAGTGGCTG
GACAAGGCGA GACCGATGTT CACTTCCGAG AAGGGGAAGG AGATCCTGAA GCGGCTGGAG
GCCGCGAACG AAGAGTGGTT CGCGGTACAG CAAAAGGTGA TCGACCTGGG CGCGAAGGAG
GCGCTTAACG ACAGGAAGCA GTCGGTGGAA CTTTCCTTTG GAGAAGCGAG GACCAAGCAG
ATGGCGGTGG ACGATAACCT CACCGAGCTG GCCAGGTTGA AGGAGGGCAA CGCCAAGGAT
GCCTCGGACG AGACCACCAG GATCTACAAG TCGAGCCTCA CCATGATGGT CGGCCTGGTC
GCGGGCGGCG TGCTGATCGG CCTGGCGCTG GGGATATTCA TTGCGCGGAT GATCAGCGTG
CCGCTGCGGC GTGGCGTAGA ATTCGCCACT TCCGTGGCCG GGGGCGATCT GACCAGGAGT
ATCAATCTGG ACCAGAAAGA CGAGGTCGGG CAACTCGCGG CGGCGCTGAA CGACATGGTG
GGAAGGCTGA AGGAAATCGT GGCGGAGGTG AAAAGCGCCT CGGACAACGT CGCCAGCGGC
AGCCAGCAGC TCTCGTCGGG TGCTGAGGAA ATGTCGCAAG GAGCGACCGA GCAGGCGGCC
TCGGCCGAAG AGGCCTCCTC CTCCATGGAA GAGATGACTT CCAACATCCG GCAGAACGCG
GACAACGCCA TGCAGACCGA GAAGATAGCA GTGAAGTCGG CGCAGGACGC GAAGGAAGGG
GGGCAGGCGG TCCAGGCGAC GGTCAACGCC ATGAAGGAGA TAGCAGGCAA GATCACCATC
ATCGAGGAGA TCGCGCGCCA GACGAACCTC TTGGCCTTAA ACGCGGCGAT CGAGGCGGCG
AGGGCCGGCG AGCACGGCAA GGGATTCGCC GTGGTGGCCA GCGAAGTGCG CAAGCTTGCC
GAGAGAAGCC AGAAGGCGGC AGCCGAGATC AGCGACCTCT CCTCCAGCAG CGTGGACGTC
GCGGTGAAGG CCGGGGAGCT CTTGGCCAAG ATGGTTCCCG ATATCCAGAA GACCGCTGAG
CTGGTGCAGG AGATCAGCGC CGCGAGCCGC GAACAGGACA CCGGGGCGGA GCAGATCAAC
AAGGCGATCC AGCAGCTCGA CCAGGTCATC CAGTCAAACG CAGGCGCCTC GGAGGAGATG
GCCTCCACCG CCGAAGAGCT CTCCAGCCAG GCGGAGCAAT TGCAGAGCGC TGTAGCCTTC
TTCAAGATCG GGGATGACCG GTTCGGCAGG GGGGGCAAAG CCGTGGCGGG CTCTCATTCG
GCAGCAAAAC CGAGGGCTTT GACCCGTTCC AAGGCTGCGC CCGGCGCTGC GCCGCTGAAG
AAGGCGGCCG GGCACGACCT TGAGCTTTCG GAGAAAAAGG AGCTACACGA CGGGGACTTC
GAGAAGTACT GA
 
Protein sequence
MKWFSNLKVG TKLISAFIVV SLITAIVGYI GIRNMGAIND MADEMYQKEL LGISYIKEAN 
INLIYISRAE KNFLLATSAQ ERETSQGNIN KYKAGYKEWL DKARPMFTSE KGKEILKRLE
AANEEWFAVQ QKVIDLGAKE ALNDRKQSVE LSFGEARTKQ MAVDDNLTEL ARLKEGNAKD
ASDETTRIYK SSLTMMVGLV AGGVLIGLAL GIFIARMISV PLRRGVEFAT SVAGGDLTRS
INLDQKDEVG QLAAALNDMV GRLKEIVAEV KSASDNVASG SQQLSSGAEE MSQGATEQAA
SAEEASSSME EMTSNIRQNA DNAMQTEKIA VKSAQDAKEG GQAVQATVNA MKEIAGKITI
IEEIARQTNL LALNAAIEAA RAGEHGKGFA VVASEVRKLA ERSQKAAAEI SDLSSSSVDV
AVKAGELLAK MVPDIQKTAE LVQEISAASR EQDTGAEQIN KAIQQLDQVI QSNAGASEEM
ASTAEELSSQ AEQLQSAVAF FKIGDDRFGR GGKAVAGSHS AAKPRALTRS KAAPGAAPLK
KAAGHDLELS EKKELHDGDF EKY