Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Maqu_0671 |
Symbol | |
ID | 4654145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinobacter aquaeolei VT8 |
Kingdom | Bacteria |
Replicon accession | NC_008740 |
Strand | - |
Start bp | 771371 |
End bp | 774334 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639810621 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_957956 |
Protein GI | 120553605 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.239322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTA GCGCCACCGG CCTGCTCCTG GCCAGCCTGT TTTATCTGAT CGTTCTGTTC GGCATTGCCT GGGGCACCGA ACGGGGCACG TTATTGCGCC GCTGGGTTCG CCACCCACTG ATCTATACCC TTAGCCTTGG GGTTTATGCC GGCATCTGGG CCGTCTACGG CGCCATTGGT ATGGCGGCGG ACTCCGGCTA CGGCTTCCTG GCCTATTACC TGGGCATCAG CGGCGCCTTC CTGCTGGCAC CGGTTCTGCT CAACCCGATT CTCCGCATAG GCCGCGCCTA CCAACTGACG TCGCTGGCCG ATCTTTTCGC CTATCGTTAC CGCAGCCAGT GGGCGGGCAC CCTGGTCACC CTGTGTTCGG CTGCAGCCAT TCTGGCCTTG CTGAGCATGC AGATTCAGGC AGTCTCGGTT TCCGCCAGCC TGCTCGCCCC GGATGCGCCG CCGAAAGTTA TCAGCGTCAT GTTTAGCCTG ATCGTGGTGC TGTTCACCAT GCTGTTTGGT GCCCGTCGCC ATCAGGCCTC CGGCAACCAT CAGGGGCTGG TACTGGCCAT CGCCTTCGAC TCCCTGGTGA AAGTGGTTGC CCTCCTGGTT TTAGGCGGCG TTATCCTGTT CTCGGTTTTC GGTGGTACCA ATGGGCTTGA GGCCTGGTTG GCGGAAACTT CACATCCGGA AGCCACCATG ACCATGGCAA TTAATGACGG AAGCTGGCGG GCACTGCTGC TGATGTCGTT CGCCGCGGCC CTGGTGCTCC CCCACATGTA CCACATGACT TTCAGTGAGA ACCCGTCCCC CCGTGCCCTC GCGAAGGCCA GCTGGGGATT GCCACTGTAC CTGCTGCTGG TCGGCCTGCC GGTGCCGCTG ATTGTCTGGG GTGGCCAGGC GCTGTCGGTT GACGCTCCGC CGGCCTTTTT CAGTATCGGC GTAGCCCAGG CCCTGGAGAG CCCCGTCCTG ACCATGGCCA TGTACATTGC CGGGCTTTCC GCCGCCAGCG GACTGATGAT CGTCAGTACC CTGGCCCTGT CCGGCATGGT GCTTAACCAC GTTGTTCTGC CGCTCAAGAC GCCCCGGGAT CAAGGTGACA TCTATCGCTG GCTGCAGTGG GTCAAACGCC TACTGATTGC CGTCATCCTG TTTCTGGCCC TGCTGTTCCA TGAAACCATT GGCCAGCACC TGGACCTGTC TATTCTGGGC GCCATCTCGC TCTCCGGCAC ACTGCAGCTG CTGCCGGGCG CGCTTGGCGT CATCTACTGG CCCGAAGGCA ACCGCCGTGG CCTGATTGCC GGCCTGTTTA CCGGCCTGTT CATCTGGTTG ATCACCTTGG TGTTGCCGTT CTCGGTGGCG GCAGACATCC TGTCCTGGCT TGAGCTGCCG GTTACGCCCG GCTACGACAA CTGGCACCTG TTCACCTTCG TCAGCCTGAC CGCCAACATC AGTGTGTTCG CGCTGATCTC AATTCTCAGC CCGGCATCAC CGGATGAAAC CAGTGCCGCT CAGGCGTGCT CGCTCGGCGC CCTGTCCCGG CCGCAGCGGC GGGAACTGCT GGCAGCATCC TCCAACGAAT TCGTGCAGCA ACTGGCCGAG CCACTGGGCA TCCGGGTCGC CCGTCGGGAA GTGGAGCGGG CCCTGAACCA GCTCAAGCTC CCCAATGTCG AGTTCCGGCC TTACCAGTTG CGGCGGTTGC GGGATCAAGT GGAAATCAAC CTGTCTGGCC TGTTGGGGCC TTCGGTCGCA CGGGACATCG TCAAGCGCCA CCTGGGCTTC AAACCGCTGA CCCATGGAGG TACCGGACAG GATATTCGCT ATGTGGAGCG AGCGCTGGGT GATTACCAGA ATCAGCTCAC TGGCCTGGCC GGCGAACTGG ACAACCTTCG CCGCCACTAC CGGCAGACCC TGCAAAACCT GCCCATCCCC GCCTGTTCAG TGGGTGAAGA CGGCGAAATT CTGATGTGGA ATCATGCCAT GGAGAGCCTC ACCAACATTT CCGCCGAGGA AGTGGTCGGG GCCAAGCTGC TGGCACTGCC GGAGCACTGG CACGCCCTGC TGGACGATTT CAATCGAGGC GACGACCTGC ACCGCTACAA GCACAGGTTG GATCTTCGCG GCAAGCCACA TTGGCTGAAC CTGCACAAGG CGGCGCTAAG CGGCCCGGAT CACACCGAAG GCGGCTCGAT CATCCTGGTG GAAGACCAGA CCGAAACCCG CCTGCTGGAA GATGAATTGA TGCACAGCGA ACGACTGGCC TCCGTAGGGC GCCTCGCGGC CGGCGTCGCT CACGAAATCG GTAATCCCGT CACCGGCATC TCGTCTCTGG CCCAGAACCT GAAACTGGAA ACCGATGATC CCTCCATTCT GGAAACCGCA GACCAGATCC AGCAGCAAAC CCGACGCATA TCCACCATCC TGCAGTCGCT GATGAACTTC GCACGTACCG GCAACCACGC CCACGCCAAC CGTTATGAGC CGGTCAGCCT GCACCGCTGC ATTGATGAAT CCATCAACCT GCTGTCGCTC AGTGACAAGG GGCTCGGCAT CAGTTTTCTG AACGAGTGCC CCGCCAGCCT GCAGGTGCTG GGAGATGAAC AACGGCTGGT GCAGGTATTC ATCAACCTGC TGGCCAACGC CCGGGACGCC TCCCCAGACG GCGGTACCAT CCGGGTTGCT GGAAAGGCCG ACGGCTACTC CGCTATCATC GAGGTCATTG ACGAAGGCTC CGGCATACCG GCAGACCAGC TGGATCATAT TTTCGAACCG TTTTACACCA CCAAGGCCCC CAACAAAGGA ACCGGGCTTG GCCTGTCACT GGTGTACAGC ATCATTGAAG AGCACTATGG CAACATTCAG GTGGAAAGCC CGGCCAATCC GGAAACCGGC CTGGGCACCT GCGTGAGACT CCGCCTGCCT GCCTACGAGC CTGATACCGA CAGCGGTTAC AGCAGCCCGA ACGAAAGGTC GTGA
|
Protein sequence | MSFSATGLLL ASLFYLIVLF GIAWGTERGT LLRRWVRHPL IYTLSLGVYA GIWAVYGAIG MAADSGYGFL AYYLGISGAF LLAPVLLNPI LRIGRAYQLT SLADLFAYRY RSQWAGTLVT LCSAAAILAL LSMQIQAVSV SASLLAPDAP PKVISVMFSL IVVLFTMLFG ARRHQASGNH QGLVLAIAFD SLVKVVALLV LGGVILFSVF GGTNGLEAWL AETSHPEATM TMAINDGSWR ALLLMSFAAA LVLPHMYHMT FSENPSPRAL AKASWGLPLY LLLVGLPVPL IVWGGQALSV DAPPAFFSIG VAQALESPVL TMAMYIAGLS AASGLMIVST LALSGMVLNH VVLPLKTPRD QGDIYRWLQW VKRLLIAVIL FLALLFHETI GQHLDLSILG AISLSGTLQL LPGALGVIYW PEGNRRGLIA GLFTGLFIWL ITLVLPFSVA ADILSWLELP VTPGYDNWHL FTFVSLTANI SVFALISILS PASPDETSAA QACSLGALSR PQRRELLAAS SNEFVQQLAE PLGIRVARRE VERALNQLKL PNVEFRPYQL RRLRDQVEIN LSGLLGPSVA RDIVKRHLGF KPLTHGGTGQ DIRYVERALG DYQNQLTGLA GELDNLRRHY RQTLQNLPIP ACSVGEDGEI LMWNHAMESL TNISAEEVVG AKLLALPEHW HALLDDFNRG DDLHRYKHRL DLRGKPHWLN LHKAALSGPD HTEGGSIILV EDQTETRLLE DELMHSERLA SVGRLAAGVA HEIGNPVTGI SSLAQNLKLE TDDPSILETA DQIQQQTRRI STILQSLMNF ARTGNHAHAN RYEPVSLHRC IDESINLLSL SDKGLGISFL NECPASLQVL GDEQRLVQVF INLLANARDA SPDGGTIRVA GKADGYSAII EVIDEGSGIP ADQLDHIFEP FYTTKAPNKG TGLGLSLVYS IIEEHYGNIQ VESPANPETG LGTCVRLRLP AYEPDTDSGY SSPNERS
|
| |