Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1707 |
Symbol | |
ID | 7271270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1769973 |
End bp | 1773023 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643570322 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_002466738 |
Protein GI | 219852306 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.569761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCTA ACGAAAATAC AAAGGAACTT GAAATAACAA AGGAAAAGGC GCAGCAGTTT GAGTTACTCT CCGCCGGACA GAAGAAAGCA GTCACTGATA TTCTTTTCAT CGCAGAGAAG GCGAAGGGCG GAGATCTCTC TGCACGGATT GACCTTGCCA ACCACACCGG AGACTTCAAG ATGATCGCCG AAGGGATCAA CACCCTCCTC GACGTCTTCA CCGAGAAGAA CTCCTGGTAT AAGGCGATCA TCGATGCGGT GCCATTCCCG ATCCATGTCA CCGACCTGGA CATGAAATGG ACGCTGATGA ATGCGGCCTT TGAGAAGACG CTGATCGAGC AGGGAGCAAT TAAGAACCGT GACACCTCCA TGGGACTGCC CTGTTCCACA GCCAACGCGA CGATCTGCAA CACCGAGGGA TGCGGGATCC GGCAGCTGCA CAAAGGGGTC ACCGAGAGTT ACTTCGTCTG GGGTGAGATG AATGGCAAAC AGACCACCTC GTCCCTCCGC GACCAGAAAG GGAACCAAAT TGGATATGTT GAAGTCGTAC AGGACCTCAC TTCAGTCATT GCGGTCCGTG ACTTCACCAA GGCCGAGGTG GACCGAATGG CAACAAATCT CGCCCTGCTC GCAAAAGGAA ACCTTAACTT CGACCTGACC GTCAAGCCGG CTAACAAATT CACCCAGGCA GAGCATGATG ACTTTGTGAA GATGAACGAG AACCTGCAGG AGGCGCAGCT GGCGATCAGT GGCCTCATCA ACGATGCGGA GATGCTTGCC AATGCAGCCG TCGAAGGCAA ACTCTCGATA AGAGCGGATG CAACAAAACA TCAAGGCGAT TACGAGAAGA TTATCGAGGG ATTCAACCAG ACGCTCGATG CGGTCATCGG ACCATTAAAT ATCGCTGCTG ACTTTGTCGC CAAGATCAGC GCAGGGATAG CACCAGAAAA GATCACAGCG GATTATCAGG GTGACTTCAA TGTTCTGATG CATAACCTGA ATCAGCAGAT CGATGTGATT GTGATGCGGA ATACTGAACT TGATATGCTG ATAGAGGCTG CAATCGAGGG TAAACTCGCA ACACGCGCCG ATCCTTCCAA GTTCCAGGGT GGTAACAAAA AGATGATGGT GGGTCTCAAT GCGATGCTCG ATGCCATCAT CGCTCCGTTA AATGTTGCCA TAGATTATTA TAATAAAATC GGTATTGGCG ACATTCCAGA AAAGAGAACC AACAAGGTGA ACGGCGATAT CATCGCGATG CAGAAAAGCA TCAACAACTG TATCGACAAC ATCAATGCAC TCGTTGCTGA TGCCAATATG CTCTCGGTAG CAGCAGTCGA AGGAAAACTC GCGACTAGGG CGGACGCAAC CAAACATCAG GGTGATTACC GGAAGATCAT CGAAGGGGTC AACCAGACCC TCGATGCTGT TATCACACCA TTGAATCTGG CTGCAGAATG CATTGATCGG ATCAGTAAAG GTGATATTCC TGATAAGACA AACCGCCAGC TCAATGGTGA CTTCAACACA CTTCATATCA ACCTGAACAA CCTCATCGAC AACATCAATG CACTCGTCAC CGATGCTAAT CTGCTCGCAG AAGGCGCCAT AGAAGGAAAA CTCGCCAACA GAGCGGATAC AACCAGACAC CAGGGTGATT ATCGGGAGGT TATCGAGGGA TTCAACAGAA CGCTCGATGC CATCATCGCT CCGTTAAATG TTGCCATAAA TGGTTATGAT AAGATCGGGA AAGGCGATAT CCCCGAGAAG AGAACCAACA AGGTGACTGG GGATATCATC GGGATGCAGA AGAGCATGAA CGCCTGTATT GACAACATCA ATGCACTCGT CACCGATGCT AATTTGCTCG CAGAAGCCGC CATCGAAGGA AAACTCGCCA ACAGAGCGGA CGCAACCAGA CACCAGGGCG ACTACAGGAA GATCATCGAG GGATTCAATA AGACGCTCGA CGCCGTGATC GAGCCGGTCG ATGAAGCGAT GCGGGTTGCA CATGAGTTCT CCGAGTATAA TTATCAGGCC AGGATGGACA AGAACCTCAG AGTTGCCGGA GACTTTGTCA AATTCAGGGA TGCCCTGGAC AATATCGGGA TCTCGGTCTC AGCCGCAATA GGAGACATCA ATAATCATGT CACCGACCTC GCAGCCTCGG CAGAGGAGGC GAACGCAAGC ATTGAGGAGG TGGTTTCCGG GGCACAGCAG GTCGCTGAAA GTGCCGGCAA GGTCAGTTCC AACGCTGAGA AAGGGAACCA GGGTCTCGAA CAGGTACTCA AGGCGATGGA GGACCTCTCT GCCGCCGTTG AGGAGGTGAC CGCCAGCAGC GAGTCTGTAG CTAACCTTGC AAACAGTGCA AACACCCTCT CCAAGGACGG GGCCGAACTC GCACGGAAGG CCGAACAGGG GATGGTCGGG ATCACCCGAT CCACCACGGA GGTGGACCAG ATCATCGGCG AGATCAAGTC AGAGATGCAG AAGATCGGAA AGATCGTCGG CCTGATCTCG GACCTGGCCA ACCAGACCAA CCTGCTCGCC CTGAACGCGG CCATCGAGGC CGCCCGAGCC GGCGATGCCG GCCGTGGGTT CGCCGTGGTC GCCGCGGAGG TGAAGTCCCT CGCCCAGGAG TCGAGGACCT CTGCTGAGTC GATCGCCGAG ATGATCGGCG GTCTCCAGCA CAAGTCAGAA CTGGCCGCAC AGGCGACCGC ATCCGCCTCA AAAGAGGTCG GTGAGGGGTC AGCGGTCCTC TCTGAGACTC TGAACGTCTT CAACCGGATC GTTGCCGATG TCGAGAAGAT CACCCGCTCG GTCGAAGAGG TTGCAAGCGC CTCCGAAGAG CAGGCAGCCA CCGTCGAGGA GATCACGGCC AGTGTTCATG AGGTGAGTTC CCTGGTGGAC GGAACGGCCA GCGACGCAGG GGATGCCGCA GCAGCCAGCG AGGAGTCCTC GGCGGCGATC GATGAAGTCG GAAAGATCAT CGAGAATGTA AACGTGATTG TCGACTCAGT CTCCAGTGGG ATCAATAAAT TCAGGGTCTG A
|
Protein sequence | MDPNENTKEL EITKEKAQQF ELLSAGQKKA VTDILFIAEK AKGGDLSARI DLANHTGDFK MIAEGINTLL DVFTEKNSWY KAIIDAVPFP IHVTDLDMKW TLMNAAFEKT LIEQGAIKNR DTSMGLPCST ANATICNTEG CGIRQLHKGV TESYFVWGEM NGKQTTSSLR DQKGNQIGYV EVVQDLTSVI AVRDFTKAEV DRMATNLALL AKGNLNFDLT VKPANKFTQA EHDDFVKMNE NLQEAQLAIS GLINDAEMLA NAAVEGKLSI RADATKHQGD YEKIIEGFNQ TLDAVIGPLN IAADFVAKIS AGIAPEKITA DYQGDFNVLM HNLNQQIDVI VMRNTELDML IEAAIEGKLA TRADPSKFQG GNKKMMVGLN AMLDAIIAPL NVAIDYYNKI GIGDIPEKRT NKVNGDIIAM QKSINNCIDN INALVADANM LSVAAVEGKL ATRADATKHQ GDYRKIIEGV NQTLDAVITP LNLAAECIDR ISKGDIPDKT NRQLNGDFNT LHINLNNLID NINALVTDAN LLAEGAIEGK LANRADTTRH QGDYREVIEG FNRTLDAIIA PLNVAINGYD KIGKGDIPEK RTNKVTGDII GMQKSMNACI DNINALVTDA NLLAEAAIEG KLANRADATR HQGDYRKIIE GFNKTLDAVI EPVDEAMRVA HEFSEYNYQA RMDKNLRVAG DFVKFRDALD NIGISVSAAI GDINNHVTDL AASAEEANAS IEEVVSGAQQ VAESAGKVSS NAEKGNQGLE QVLKAMEDLS AAVEEVTASS ESVANLANSA NTLSKDGAEL ARKAEQGMVG ITRSTTEVDQ IIGEIKSEMQ KIGKIVGLIS DLANQTNLLA LNAAIEAARA GDAGRGFAVV AAEVKSLAQE SRTSAESIAE MIGGLQHKSE LAAQATASAS KEVGEGSAVL SETLNVFNRI VADVEKITRS VEEVASASEE QAATVEEITA SVHEVSSLVD GTASDAGDAA AASEESSAAI DEVGKIIENV NVIVDSVSSG INKFRV
|
| |