Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0950 |
Symbol | |
ID | 4284875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1046424 |
End bp | 1049159 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638140418 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_756181 |
Protein GI | 114569501 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.666627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTCG ATCATCGTCC GTCCAAGGCC CCGGCGGCCA AGGACAGGAC GCCGCAATCC CCGTCCCGGG GATCGAAAGG TCGTGACCGC AACCCGCCCA CCGACAAGCC TGCCAAGAAA CGTGGCAGCG GCGTGTCCTG GATCATCCTG GCGATGGTTC TATTCACCCT CGCCTATGGC GGCATCCTTC TGGTCAAGAT CCAGCAGGAG CGCGAGATTC TGATCACCGA GGCCGAACGG TCCCAGGCCA ATGCGGCGGG GTATCTGGCC GAGCGTGTGA CCGCGCGCAT CGCCGAAGCC CGCTACGCTC TCGCCTTCGC CGCAGCAGAT TTGCGCGATG CGTCGCCGCA ATCGGTCGAG GGGCTGACCC GAGCCCGGCT TGCCGCCATC AACGAGTCCG AACTGGTGAC CGATGTCGCC CTGCTTCTGC CCGGCGGCCA GGTCATCGCC GTTGGCACAC CGGCAGACGG GCTGCGCGAC ATCACCGGTG CCGCGCTCGA CGCGCCGTCC GGTCTGACCG CCCAGGTCAC GGGCGGCGTC GCTCATCTCA TTCTCGCTGT TCCCACGCCG CTGAGCGACG GAACAGTGGG GGCATTCGTG GCACGACTGA GTACGGAATC AGCCTTGCCT GACTGGGGAG ATGACCGTGT GGTCGCCTTG GCCGATGCGC AAGGCGGCAT GCTGGCAATC CGCCCGCGCA TGCCCAGCGC CCGCCCCGGC ACCCAGCTGG CCGAGCGTTT CGGCCTGCAG CCCGGCTTTA TCGACTCACT GGCCGCCGGC GGCGGCGGTG CCACCTCCGA CGCCCGCTTC GGTGAGGAAC GCGTGACCTT CGCCGTCGCC CCGATCACGG GCACCGAATT GCGTGTTTAC GCCCTCGGCG CAATGCGCAT CAACCAAGAC GCATGGTACC GCACCATTTC ATTTTACGGG CTGATGTTCA TCGCCCCGAT TTTTGTGGCA CTGGGCCTGT GCGCCCTGGT CTTCATGCAG ATGGGCCGCC TGCGCTCGAC CCGGCAGCAG CTTGAAGACA ATGAGCAACG CTTCCGCGTC GCCATCGAGG GCGCGCGCTG CGGTGTCTGG GACTGGAATC CCGAGGCCGA CACGGTTTTC GTCACCGACA GCCTGGCCCG TATTCTCGGC CTCGACGCAG CGACGGAATG TACCGGTCAG CAATTCCTGC AGCTCTTCTC CAAGCCTGAC CGGGAACGCC TGCGCGCAGC CATGCGCGGG GCTCCCGCCG GCGCCGAGGT CGATCATGAA GTGCTGGCCG CGCGGTCCCC GGTCTGGCTG CAGATGCGCG GACGCATCCT GCCCGGCGGT GAAGCCAATC ATACCCGGAT CATCGGCGTC GCCATTGATG TCACCGAACG CAAGGGCGCC CAGGCCCGGG TGGCCGCGGC CGAAAACCGG CTGCGCGCGG CGCTGGAATC CATGTCCGAG AGCTTCGTGT TGTGGGACAG CCGCCAGCGC CTGGTGCTGT GGAACCGCAA GTTCCGTGAC CTGTTCGACT TCACCGACGG CATGCTCAAG CCGGGCATGA GCTATGACGC CGTCGAGCAG GCCGCCGCCC GCGCCATCAA GACCGTTCAC GGCGGAAGCG AGGGCAAGGC CGCCTATGAG ATCGAGCTGT CGAACGGCCG CTGGCTGCAC TATTCCGACC GTCCGACCGC CGATGGCGGC CTGGTCTCGG TCGGCGCCGA TATCACCGAT CTGAAGCATC ACGAGGCCGC CCTCACCGAG AATGAGAGCC AACTTCGCAA GACCGTCGAC GACGTCAAAC GCTCCCAGGC CCGTATCGCC GATCTGGCCA AGAAGTACGA AGAAGAGAAG ATACGGGCCG AGGAAGCCAA CCGCTCCAAG TCCGAATTCC TCGCCAATAT GAGCCATGAG CTGCGCACAC CACTGAACGC CATCAATGGT TTCTCCGAAA TCATGATGCA GGAAATGTTC GGGCCGCTCG GTGATGATCG CTATGTCGGC TACATGAAGG ACATCCTGTC CTCGGGCCGT CACCTGCTGG AACTGATCAA CGACATTCTC GACATGTCGA AGATCGAAGC CGGCAAGATG CAGCTTCAGC CTGAACCGAC CGATGCCAGC GAGCTGGTCG AGCAGAGCAT CCGCATTGTC CGTGGCCGGG CCGAGGAAAA ACAGTTGAAA TTGCGCGCCG ACGTGTCGGA TCTGCCGGAG ATCGAGGTCG ATCCTCGGGC CTTCAAGCAG GTCATGATCA ATCTGGTGTC CAACGCGGTG AAGTTCACGC CCGAAGGCGG CCGGGTCACG GTGCGCGGTT TCCTGTCTGG CCTGGGCGTG GCCTTCCAGG TCTCCGATAC CGGTATCGGC ATCGCCAAGG ACGACCTGCC ACGCCTCGGC CGCCCGTTCG AACAGATCGA GAGCCAGCAC TCCAAGAGCT TCCAGGGCTC CGGCCTGGGC CTGGCCCTGT CCAAGTCACT GATCGAACTT CATGGCGGCA CATTGTCGAT CGACTCGGTT CTCGGCGAAG GCACCACCGT GTCGGTCGTC CTGCCGATCA GCCAGGACCA GCCCATTCCG CGCGACGCGA TCAAATCGGT AACCGGCGAT ACCGGCCATG ACGATGTCGA CACGATCGAG GACGACGGCA TCGACACACA CGCACCACTT GGCCTGTCAG CCAGCCTGCC GACGGATGAC GGTTTCGATG TCGATGATGA ATTCGTGGAT AGCGAGACGC CGCCCCGGTT CGCCGCCGGC GAATAG
|
Protein sequence | MRFDHRPSKA PAAKDRTPQS PSRGSKGRDR NPPTDKPAKK RGSGVSWIIL AMVLFTLAYG GILLVKIQQE REILITEAER SQANAAGYLA ERVTARIAEA RYALAFAAAD LRDASPQSVE GLTRARLAAI NESELVTDVA LLLPGGQVIA VGTPADGLRD ITGAALDAPS GLTAQVTGGV AHLILAVPTP LSDGTVGAFV ARLSTESALP DWGDDRVVAL ADAQGGMLAI RPRMPSARPG TQLAERFGLQ PGFIDSLAAG GGGATSDARF GEERVTFAVA PITGTELRVY ALGAMRINQD AWYRTISFYG LMFIAPIFVA LGLCALVFMQ MGRLRSTRQQ LEDNEQRFRV AIEGARCGVW DWNPEADTVF VTDSLARILG LDAATECTGQ QFLQLFSKPD RERLRAAMRG APAGAEVDHE VLAARSPVWL QMRGRILPGG EANHTRIIGV AIDVTERKGA QARVAAAENR LRAALESMSE SFVLWDSRQR LVLWNRKFRD LFDFTDGMLK PGMSYDAVEQ AAARAIKTVH GGSEGKAAYE IELSNGRWLH YSDRPTADGG LVSVGADITD LKHHEAALTE NESQLRKTVD DVKRSQARIA DLAKKYEEEK IRAEEANRSK SEFLANMSHE LRTPLNAING FSEIMMQEMF GPLGDDRYVG YMKDILSSGR HLLELINDIL DMSKIEAGKM QLQPEPTDAS ELVEQSIRIV RGRAEEKQLK LRADVSDLPE IEVDPRAFKQ VMINLVSNAV KFTPEGGRVT VRGFLSGLGV AFQVSDTGIG IAKDDLPRLG RPFEQIESQH SKSFQGSGLG LALSKSLIEL HGGTLSIDSV LGEGTTVSVV LPISQDQPIP RDAIKSVTGD TGHDDVDTIE DDGIDTHAPL GLSASLPTDD GFDVDDEFVD SETPPRFAAG E
|
| |