Gene Mmc1_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_3168 
Symbol 
ID4483272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp3948357 
End bp3951137 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content54% 
IMG OID639723917 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_867064 
Protein GI117926447 
COG category[N] Cell motility
[P] Inorganic ion transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2703] Hemerythrin 
TIGRFAM ID[TIGR02481] hemerythrin-like metal-binding domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000240071 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0523799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATT TAAAAATTGT GTATAAAATT GGGCTGGGTT TCATGGTGCC TGTGGTCTTG 
ATGATCGGTG TCGGTGTTTG GAGTTATCTG GTCAGTGGCA GTGTTGCAGA AAAGTCCTAC
CATGCGCGGG ATACCAGCGT GCGTTTGGCG CTACAAGCGA GCCAACTTGA GCTGGATGTG
GTGCAGGTGC AACAGTGGCT GACCGATATT TCCGCAACCC GAGGTGCCGA GGGTTTTGAT
GATGGGTATG GCGAGGCAGA AAACTACTAT CAAGCAGCCT TGTTCAAGCT GGATGATTTT
GCCAAACATT ATAAGAAGCA GGGTAAAACC GACGAGGTTG CACAGATAAA AAAACTGAGC
AGTGCGTTGA CAGCTTGGTA TGCCGTGGGC AAAAAAATGG CCCAGGGTTA TATCGAAGGC
GGACCGGTGC TGGGTAACCA GCTCATGGGG GGCTTTGATG AACAAGCGGA GATTTTGCAA
GGTCAGCTCA AACCCTTTTT GGATGGACAA CGCCAGCAAA TTTATACTTT GCTGACCATG
GTTGTGGATG AAGTGAACTA TTTTGAGCAT GGGGTGGCGC TGCTTTCCTC GCTGTCGGTG
GTGTTGGTGT TGGTGCTGGG CTACCTGATC TCGCGCACCA TCACGGTGCC GGTGGCTAAA
ATGGTGGGTG CCATCAAACA GATGGCGGGG GGGGATCTGG TGGTGCGTTG CACCATGGAC
GAGCGCCGCG ATGAAGTAGG CCAAATGGCG GTAAACCTAA ACCGTTTGGC CGATGTGCTG
TGCGAAAATA TACGGATGAT TAACCTGCAA GCGGCCAACA TTAACTCCTT TGTGGGGGAG
GTGGTGCAAC TGCGGCAGGG ATTGGGGGAA AATAACCGCG ACCTGAACGA TACCACCGAG
CAGGTGGATG CCAAAAATCA GGGGTTAAGC ATGGAGATCC AAGGGATCCA AAGCCATGTG
GGGGAGACGG TGGTTAACTT GGAAACCCTG TTCCATGCTA TCCAGGAGGT CAGCAGTGGG
GTGAATACCA TCACCCATGG GGTGGACGAG GCCAACAGCA ATGTCAACAC CATGGCTGGT
GCGGCAGAGC AGATGTTGGC CAATGTGGAA GATGTGCATA CACAGATGGC CCAGGTCTAC
ACCTCGGTAG ATCACGTTTC ACGCTCGGTG CTGGAGTTGC AGAACTCACT CAAGGATGTA
CGCAAACGCT GTCAAGGGGC GGCTGCGGAG TCGGAACAGG GCAAAACCTT GGCTCAGGAT
GCCACCACAG TGATGGGGCA ACTGACCGGT TCGGCGCGGG AGATCTCCAA GGTAGTGGAG
GTCATCAATA CCATTGCGGA GCAGACCAAT ATGTTGGCTT TAAACGCCTC CATTGAGGCG
GCTGGGGCCG GTGAGGCTGG TAAGGGGTTT GCGGTGGTGG CCAACGAAGT TAAGGATCTG
GCCAGTCAAA CGGCCGGTGC CACGGAGACC ATTTGGCACC AGATCGACAC CATGCGTGGT
TTGACCGAAC GGGCCGAAAA CAGCACCAAG CGCATTCAGG AGGTGGTGGA TCGCATTGCC
ATGGCCAACA GCGATATCAA TATGGCGGTC GATGAACAAG GTCACGCCAC CAATGGGATT
GCCGAGGCCA CCAATCAGGT CTCCCGGGTG GCCGAAGAGG TGGCCCGCAA TGCCCAGGAG
CTCAGTGCGG CGGCTGGCGA TGTGGCCCGG GCAGCGGCAG AGGCCGCCAG TGGCACCCAG
CGTATCGCCT CCACCAGTGA TGAGATCGCC CAGGCGACCC ATGCCATGGA GCAGCAGGCA
CAGCAGGCAA CCTCCTCGAT TCACGCCATT CAACACGCGG CGCAGAGCAC GGCGGAAGCC
TCCCAGGTGG TGGGGCAGCA GTTGGCGGTT ACCAGAAGTG TGGTGACGGC CATGCGCGGT
TCAGTGCAAC AGTTTAATGC CCTAAGCGAT GTCGCCGCCG GGGTGAGTGA GGCGCTCTAT
GCGGCGCAAT CCCGCATGGA TGTTGGACCA GAGCTGTTTG ATGTGCAATT GATCCGCGAG
GGAATTTTGC AGGTCATGGG TCGCATCGGC CATGCCGCAA CGGTGCGGGA TGTTTCGTTG
GTTAACGATC TGTTGGACAA CCAAAACCAG CCCATGTTGG ACAAATTGCG CCAGGAGCTG
CCAAAATCGG TACAAGCCCT GCCGCTGTTT GTAAAAATGG AGCGTACCGC CGAGCAGTTG
CACAAGAGCG GTCAAGATAT TATTCACTTG GTCTCTGCGG GTGAAGAAGA GGCTCTTGAA
GATGCCATGC GCTTCTACCA CCAGCTACGT AACACTTTGT TTGAGCAGCT TAACCAACTC
TATATGGGCA GTGATAAGGA TGCCAATGCG ATCGAGCCAA AAATTCGCTG GCTCAGCAGT
TATGAGGTTG GTATTGAGAC CATTGATAGC GATCACAAAC GTTTGGTCGG CATGATGAAC
CAGCTCTATG CTGCCATGAA GACCGGTTCT AGCAGTCAAG TGATGGAAGA GCTGTTTGAT
GGCCTAATCA ACTATAGTGT GGTCCATTTT CAACGGGAAG AGCGCCTGTT TGAACAGCAT
CGCTACCCCG ATGCTGCCGG ACACAGCAAA CGGCATGAAG CCTTTAAAAC CTACGTGCTG
GGTAAGCGGG ATGAGTTTGT GGCGGGGAGC AATATTCATC TGAGTCAGGA TATTTTTACC
TACCTGGAAG AGTGGTTGAT CAACCATATT ATCCATGATG ATATGGCTTA CGTACCCTAC
CTGACCAAAA AACGTGCTTA G
 
Protein sequence
MKHLKIVYKI GLGFMVPVVL MIGVGVWSYL VSGSVAEKSY HARDTSVRLA LQASQLELDV 
VQVQQWLTDI SATRGAEGFD DGYGEAENYY QAALFKLDDF AKHYKKQGKT DEVAQIKKLS
SALTAWYAVG KKMAQGYIEG GPVLGNQLMG GFDEQAEILQ GQLKPFLDGQ RQQIYTLLTM
VVDEVNYFEH GVALLSSLSV VLVLVLGYLI SRTITVPVAK MVGAIKQMAG GDLVVRCTMD
ERRDEVGQMA VNLNRLADVL CENIRMINLQ AANINSFVGE VVQLRQGLGE NNRDLNDTTE
QVDAKNQGLS MEIQGIQSHV GETVVNLETL FHAIQEVSSG VNTITHGVDE ANSNVNTMAG
AAEQMLANVE DVHTQMAQVY TSVDHVSRSV LELQNSLKDV RKRCQGAAAE SEQGKTLAQD
ATTVMGQLTG SAREISKVVE VINTIAEQTN MLALNASIEA AGAGEAGKGF AVVANEVKDL
ASQTAGATET IWHQIDTMRG LTERAENSTK RIQEVVDRIA MANSDINMAV DEQGHATNGI
AEATNQVSRV AEEVARNAQE LSAAAGDVAR AAAEAASGTQ RIASTSDEIA QATHAMEQQA
QQATSSIHAI QHAAQSTAEA SQVVGQQLAV TRSVVTAMRG SVQQFNALSD VAAGVSEALY
AAQSRMDVGP ELFDVQLIRE GILQVMGRIG HAATVRDVSL VNDLLDNQNQ PMLDKLRQEL
PKSVQALPLF VKMERTAEQL HKSGQDIIHL VSAGEEEALE DAMRFYHQLR NTLFEQLNQL
YMGSDKDANA IEPKIRWLSS YEVGIETIDS DHKRLVGMMN QLYAAMKTGS SSQVMEELFD
GLINYSVVHF QREERLFEQH RYPDAAGHSK RHEAFKTYVL GKRDEFVAGS NIHLSQDIFT
YLEEWLINHI IHDDMAYVPY LTKKRA