Gene GM21_3453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3453 
Symbol 
ID8138820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3989571 
End bp3991169 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content61% 
IMG OID644871068 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003023233 
Protein GI253702044 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0235494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAA GCAAGAAGAT TCTGATCAGC AACGTGGGCA TGGTGCTCAT CGCGACCATT 
ACCACTTCGG CTATTTCGCT CTACGTCACC AAGAAGGAGA TCACGCGCCA GGTCAACGTT
TCACTCGGCT CGCGGATCAA TGCTTTCCGC GAACTAATCA GCAGGAACGA TGGCAGCATA
CTGCTGGTGG ACGGCAAGCT GCAGGCGAAC GGGGTGACGC TGGACGGCGA CAACGCCCTT
ACCGACAGGA TGAAAGAGAT CTTCGGGGGG GAGGCGACCA TCTTCAGGTA CGACGTCCGG
GTCGCGACCA CGATCAAGAA GGAAGACGGC GCCCGCGCGG TCGGCACCAG GCTCCAGGGG
CCCGCACGCG AGGCGGTCAT CGATCGCGCG GCTCCCTACC AGGGGGAGGC AAGCATTCTG
GGGGTTTCGC ACTTCGCCTC ATATCTCCCC CTGAAAGACG GCAACGGCAA GGTGATCGGC
GCTCTCTTCG TGGGCGAGAA AAAGTCCGAG TACCTTGCGG TTTTCGACCG CCTGAAATAC
CTGATCCTCG CCCTGTCCGC ACTGCTCGGT GCAGTCCTGG CTCTGGCTGG ATACCTGGCT
CTGCACAAGG CGCTGATGCC GTTGCGAGAG TTGATCAGGA CTTTGCAGGA TGTAGCGGAA
GGGGACGGCG ACCTCACCCA CCGCCTAAAC GAATCGACCG ATGAGATAGG TACCGCCAGC
CGCTATTTCA ACCGATTCAT CGACCGGGTC CATACAATCG TGCAGACGGT GGCCGACAAC
GCGAACTCCG TGGCAAGCGC GAGTTCCGAG CTGCACTCCA GCACCGAGAG GCTTGCCGAC
ACCACTGAGG CCGTAGCCGT GCAGACAGAA ACCGTTTCGA CCGCAGGGGA GGAGATGGCT
GCCACTTCCG CGGACATCTC CAAGAACTGC CTGAGCGCGG TCGACAGCGC CCAGCGAGCC
TGCGAGATGG CGCGCTATGG CTCCGCCGAC GTCGAGCGCA CCATCGACGG AATGAAGCTC
ATCAACGAGA AGGTGCGAGC CACCTCTGAG AGCGTCGGCA ATCTGGGGGT AAAGTCGGAA
CAGATCGGCG ACATCATCGG CACCATCCAG GACATCGCGG ACCAGACCAA CCTCCTCGCC
TTGAACGCGG CGATAGAGGC GGCTCGCGCC GGGGAGCAGG GGCGCGGCTT TGCGGTCGTC
GCAGACGAGG TGCGCCGGCT AGCCGAAAGG ACCACCAGCG CCACCAAGGA GATCGAGGTC
AACATAAGGT CGATCCAGGA AGAGACCGCC CGGGCGGTGC AAGTAATGCA CGAAAGCGCC
AGGGAAGCTG CCAAGGGGGC CGAAGATTCC ATCAAATCCG GTGAGAGTCT GGAGGAAATT
CTGAAACAGG TCAACGAGGT GACGCTGCAG ATAGGGCAGA TCGCAACGGC TGCCGAGGAG
CAGAGCGCGA CCAGCCGCGA GATCAGCAAT AACGTGCACC AGATCACAGG GATCATTCAG
GGCGCAGCCA GGGACAACCG TGCATCCATG TCGACTGCGG ACGAGTTGAA CCGGCTCTCG
GAGAGTTTGA AGCTGCAGAT CTGCAGATTC AGGTACTAA
 
Protein sequence
MNISKKILIS NVGMVLIATI TTSAISLYVT KKEITRQVNV SLGSRINAFR ELISRNDGSI 
LLVDGKLQAN GVTLDGDNAL TDRMKEIFGG EATIFRYDVR VATTIKKEDG ARAVGTRLQG
PAREAVIDRA APYQGEASIL GVSHFASYLP LKDGNGKVIG ALFVGEKKSE YLAVFDRLKY
LILALSALLG AVLALAGYLA LHKALMPLRE LIRTLQDVAE GDGDLTHRLN ESTDEIGTAS
RYFNRFIDRV HTIVQTVADN ANSVASASSE LHSSTERLAD TTEAVAVQTE TVSTAGEEMA
ATSADISKNC LSAVDSAQRA CEMARYGSAD VERTIDGMKL INEKVRATSE SVGNLGVKSE
QIGDIIGTIQ DIADQTNLLA LNAAIEAARA GEQGRGFAVV ADEVRRLAER TTSATKEIEV
NIRSIQEETA RAVQVMHESA REAAKGAEDS IKSGESLEEI LKQVNEVTLQ IGQIATAAEE
QSATSREISN NVHQITGIIQ GAARDNRASM STADELNRLS ESLKLQICRF RY