Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1308 |
Symbol | |
ID | 5208260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1612153 |
End bp | 1614837 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640594923 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001275662 |
Protein GI | 148655457 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.455977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.863404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCTC GTCGGTCGTG GCATCTGACA GTACGCACCA GGATAACCCT GGTGGCGCTC GTGCTGGCGC TGGCGCCGCT GCTCTTTGTG AGCGCGCTCA GCCTTTCCAC GCTCGACCGC GCCCGCAGCA TCGCGGTGCA GACCGCATCC GAAGCCTTGC GCGAGCAGAC CGAAAGCGAT CTCGCACGCT GGGTGACCGA TAAAGCCAAC CTCTACGACG CGAAACTCGA CCGCATCTAT CATCAGGTCG AAACAATCGT CACGTACCGA CCGGGAAACT TCTCCAATCC GACCGTCGCA TCAGAGCGCG TCTGGATTGC GCCGGACGGT CCCACGCCAG CTGCGTTGCG TTCCCACGCC GCAACGGTGA CGCTTGCCCG TCAGTACATC CCGCTCCTGC GCGCTTCGGT CGGGCAGGAT GGCATGGTGA GCCTGGGGTA TGTCGCGTTT GAGGATGGCG GGGTGCTCGC TTTCGACCAG GATATCATCG ATGTTCTCGA TGCGATCAAG CCCTTTGACC CGCGCAAACG CAGCTGGTAT ATCGCTGCCC GTGATGCTGG CAGAACCGTA TGGGTCGATA CCTATGTCGA TGCCAATACC AAGAAACTGA CAACCACCTG CGCCGCGCCG CTCTACGACG AACGAGGCGC TTTCATCGGC GTCGTTGGCT TTGATGTGCT GTTGGAAACG ATCCAGCAGG ATATTCTTTC GATTGATATT CCGCCGGGCG GTTCGGCATT TTTGATCAAT CAGCGTGGCG ATGTGCTGCT CCACAACAAC CTGCTGAATC GGCAGGGGCG GTGGGATCAA CCGATTGCAA CCGACAATCT GTTGAACGAT CCCAATCCGC AGTTGCGCGA CGCTGCTGCG CGCATGACGC GCGCGGATCA GGGCGTCGTG CGCATGATGT TGCAGGATGA AGAAGTATAT CTGGCATTCG CGCCGATTGC CAGTGCAGGC TGGAGTGTCG GGATCGTCAT CCCGGTTGCC GAAGTCACCG GGTTCGTTCA GCAGGTCAGT GAAGCGATCA CCAGGCGGCA GGAGACGCTG AGCGGTCAGA TCGTCGCAGT GATTGTGCTG AGCGTTGTTG TGGTGGCGGT GCTCAGCATG CCAGCGGCGC TGATCCTGAC GCGCCCGTTG CGTGAGTTGC AGGCGGCGGC GCAGCGGGTC GCCGCCGGTG ATCTGACCTA TCGCGTACCG GAAGAGGGCG CTCCAGAAAT TGCCAATGTC GGTCGTTCAT TCAACACCAT GACCGACGCC TTGCGCGAGA AGATCGGTGA ACTGGAGCTG AACCTGCGTC AACTGGCGGC GCTGAACGAT ATGTCCAACC GCTTCCGCAC GATTGCATCG GTGCGAGATC AACTCGACGC TATTCCGCGC GGGGTGTGCG AATGTCTGGA TTTCGATCGC GCTGTCCTGT ACCTGATCGA ACAGCGCACG CTGCGCCCGG TGAGCGCATG GGTCGGCGAG GGTGACGCCG ATCAGGTTGC CCGGCTTCTG GCGGATGCGG CGCCAATCTC GCTCGATAGC GACTCCATGG CGGCGGATGT GGTCCGCAGC GGGCAGGCGG TGATCATCGG CGAACCGTCC GATCAGGCGC TATGGGGCGC ATCGGTGCAG GCGCCGCTGT TTGGTCACGA GAAGCGCGTG ATCGGGCTGC TGGCAGCCGC CTTCGACGAA CCAGGACGCA CGCCGACGGC GCGTGACGCG GCACAGTTGA TGACCTATGC CGGTATGTCC GGTCTGGCGC TTGAGAATAC GATGCTCTAC GCCGATCTCG AGCGCCAGGT CGCGCAGCGT ACCGCCGAGT TGCGCACCGC CCTGGCGCGC GCCCAGGAGG CGGATCGGCT GAAAGGGCAG TTCCTGGCGG CAGTTTCACA CGAACTGCGC ACGCCGCTCA ATGCCATCAT CGGGTTTTCA ACCGTCATGC TCGATGAGAT CGATGGTCCT GTCACACCGC TGCAACGCGA GGATTTGAAG ATCATCAACC GGAATGGTCG CTTCCTGCTG CACCTGATCG ACGATCTCCT CGATCTGGCG CGTATCGAGG CGGGCAAGAT CGAACTGGAA CTGGCGCCGG TTGACGTGCG CGCCCTGATT GTTGAGGTGA CCGAAACGGT GCAGGGGTTG CTGCACAACC GCCCGATTAC GCTCAACCTG GCGTTGCCTG AACGCCTGCC GTATGCGTAT GCCGACGCCG CCAGAATTCG TCAGGTGTTG CTGAACCTGT TGTCCAACGC GGTCAAGTTT ACGAAGCGGG GAAGTATCGA CATCAGCGCA CGGTGTGTGG CGGCGCCCGA CACACAGCCT GGCACGAAAA GCGCAGGTGC GGTGATTGTG CGCAACGGGC AGCGCCTGCA TCCCTACATC GCCGTCAGTG TTCGCGATAC CGGCGTTGGT ATTGCGCCGG AAGATCTCAC ACGCATCTTC GAGGCGTTCC ATCAGGTGCG TGCCGGCGAC CGACAGCACG GCAGCGGGTT GGGTCTGGCG ATCAGCCGAC GTCTGATCGA AGCGCACGGC GGACGTATCT GGGCAGAGAG CGAACCAGGC AAGGGGAGTG TCTTCACGTT CATTCTTCCA TGCACGTTTG TGAAGCGCAA TGGCCGACTG GAAGGCAACG AGGAACCTGC GGATGCGTCA GGCGTCAGGC ATGTTGAAGT GCAAACGTCA CTCATCGACC AATGA
|
Protein sequence | MASRRSWHLT VRTRITLVAL VLALAPLLFV SALSLSTLDR ARSIAVQTAS EALREQTESD LARWVTDKAN LYDAKLDRIY HQVETIVTYR PGNFSNPTVA SERVWIAPDG PTPAALRSHA ATVTLARQYI PLLRASVGQD GMVSLGYVAF EDGGVLAFDQ DIIDVLDAIK PFDPRKRSWY IAARDAGRTV WVDTYVDANT KKLTTTCAAP LYDERGAFIG VVGFDVLLET IQQDILSIDI PPGGSAFLIN QRGDVLLHNN LLNRQGRWDQ PIATDNLLND PNPQLRDAAA RMTRADQGVV RMMLQDEEVY LAFAPIASAG WSVGIVIPVA EVTGFVQQVS EAITRRQETL SGQIVAVIVL SVVVVAVLSM PAALILTRPL RELQAAAQRV AAGDLTYRVP EEGAPEIANV GRSFNTMTDA LREKIGELEL NLRQLAALND MSNRFRTIAS VRDQLDAIPR GVCECLDFDR AVLYLIEQRT LRPVSAWVGE GDADQVARLL ADAAPISLDS DSMAADVVRS GQAVIIGEPS DQALWGASVQ APLFGHEKRV IGLLAAAFDE PGRTPTARDA AQLMTYAGMS GLALENTMLY ADLERQVAQR TAELRTALAR AQEADRLKGQ FLAAVSHELR TPLNAIIGFS TVMLDEIDGP VTPLQREDLK IINRNGRFLL HLIDDLLDLA RIEAGKIELE LAPVDVRALI VEVTETVQGL LHNRPITLNL ALPERLPYAY ADAARIRQVL LNLLSNAVKF TKRGSIDISA RCVAAPDTQP GTKSAGAVIV RNGQRLHPYI AVSVRDTGVG IAPEDLTRIF EAFHQVRAGD RQHGSGLGLA ISRRLIEAHG GRIWAESEPG KGSVFTFILP CTFVKRNGRL EGNEEPADAS GVRHVEVQTS LIDQ
|
| |