Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3059 |
Symbol | |
ID | 5210027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3842132 |
End bp | 3844057 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640596651 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001277373 |
Protein GI | 148657168 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00716616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGGGGT CTGGTGTGGA TGCAGCTGAA CGGATCAGTC TGCTCCTCAT CGAGGACGAT GAGTCGCATG TTGCGCTGAT CGAGCGCGCC TTCGAGTCAT GGCGATCATT GTTCGATCTG TCGGTTGTGC ACACGTTGCA GGAGGCATAC GCGCTGCTCT CCGGTGATTC GTCCGACTTC GATCTGATCG TGTGCGATTG GCGATTGCCC GACGGCGAAG GGTTGGAACT GCTGGACTTC AACCGTGCGC TGCCGGTCAT TCTGATGACC GGGTATGGCG ATGAACGGGT GGCAGTCGAA GCGATCCGTT CTGGCGCGCT CGATTATATT GTGAAGAGTG ATGCCGCATT CGCCGATCTG CCCCATGTGG CGCAGCGTGC CATCCGCCAG TGGCGGGCGA TCCAGGCGCA ACGTCGCGCC GAACAGGAGT TGCGCGAAAT TGAGGCGCGC TACCGCCTCA TTACCGAGAA CACGTCTGAT CTGATCGCCA TTCTCGACGA CCAGCACCAC TTCCAGTACA TCAGCCCTTC GGTCACGACG CTGTTGGGGC ATACCGCAGA AGCGTTGGTC GGACAGGATG CATTCTTCCT TATCCATCCC GATGACCTTC CGTACAGCGA GGAGTACTGG TGCACCATGC TCCAGCGTAC CCGCGCCACT GCGACGTTTC GCTATCGTCA TGCCAGCGGC GCCTGGCGCT GGATCGAATG CAATATCAGG GCGGTCGAGC AAGGCGGCGG GTTGACGGCG ATTGTCGTGG GGCGCGATAT TACCGAGCGC CGTGAACTGG AAGAACGTTT ACTCCAGATC CAGAAAATGG ATGCGCTTGG TCGGTTGTCG GGCGGCATTG CGCACGATTT CAACAACATG CTGGCAGTGA TTGCCGGTTG CACCGAACTG GCGCGTCAAC TGATCCCTGA TGACCACCCG GCGACCGGAG AACTGATCGA AATCCAGCAT GCGACCGAGC GCGCAACGGC GCTGACCCGT CAGCTGCTGG CTTTTGCGCA CCGACAGCGG TTCGAGCCGC GCCCCATCGA TCTGAACACG CTTATTCTCG ACATGCAGAA ACTGCTGCGT CGCCTGATCC GCGAGAATAT CACCCTGAAC ACCCGTCTTG CCCCCGATCT CTGGCTTGTG CGCGCCGATC CTGGTCAGAT CGAGCAGGTG CTGGTCAATC TGGCGATCAA CGCGCGTGAT GCGATGGTCG ATGGCGGTGT GTTGACGATT ACAACCGAAA ATGCTGTGAT CGACGATGCG TTCGACAACC GCCATCCTTC GCTGAATCCG GGCATGTATG TCCGGTTGAC GGTCAGTGAC ACCGGCGTCG GCATGGACGA AGAAACGCGC AGGCGCGCCT TCGAGCCGTT CTTTACCACG AAGAAGCCCG GCGAGGGAAC CGGTCTCGGT CTGGCGACCT GCTACGGGAT TGTCGTCCAG CACGGCGGCG CCATCGAACT GACGAGCGAA CCGGGATGCG GCACAACGGT CATTATCTAT CTGCCGCGCG CCTATCCCTC CGAAGAGACA GTCGCCCTGG AGACTGGCGA GCACATCGAT CTCAAAGGCT CGGAAACGAT CCTGCTGGTC GAGGATGATC CGGCGGTGCG CACGCTGACA GCGCGGGTGC TGCGCACCCA TGGGTATACC GTTCTGGAAG CGGGTGACGG ACATGAAGCG TTGACGCTCG CCGGTGAACG TCCGATCCAC CTTCTCTTGA GTGACCACGT TATTCCGCAT ATGAGTAGTG AGTCGCTGGC ACACTATCTG ACCGCTCTGA TCCCGCAGAT GAAGGTGCTC TTTATGAGCG GGTATGTCGT TGAGCCGCGC CCCGATGTTC AGGCGCTCCA GGAAGCGACG GTTCTTCAGA AGCCATTCAC TCCAACTGCG CTCCTTCAGA GGGTTCGCGC TGTCCTGGAT TCGTGA
|
Protein sequence | MMGSGVDAAE RISLLLIEDD ESHVALIERA FESWRSLFDL SVVHTLQEAY ALLSGDSSDF DLIVCDWRLP DGEGLELLDF NRALPVILMT GYGDERVAVE AIRSGALDYI VKSDAAFADL PHVAQRAIRQ WRAIQAQRRA EQELREIEAR YRLITENTSD LIAILDDQHH FQYISPSVTT LLGHTAEALV GQDAFFLIHP DDLPYSEEYW CTMLQRTRAT ATFRYRHASG AWRWIECNIR AVEQGGGLTA IVVGRDITER RELEERLLQI QKMDALGRLS GGIAHDFNNM LAVIAGCTEL ARQLIPDDHP ATGELIEIQH ATERATALTR QLLAFAHRQR FEPRPIDLNT LILDMQKLLR RLIRENITLN TRLAPDLWLV RADPGQIEQV LVNLAINARD AMVDGGVLTI TTENAVIDDA FDNRHPSLNP GMYVRLTVSD TGVGMDEETR RRAFEPFFTT KKPGEGTGLG LATCYGIVVQ HGGAIELTSE PGCGTTVIIY LPRAYPSEET VALETGEHID LKGSETILLV EDDPAVRTLT ARVLRTHGYT VLEAGDGHEA LTLAGERPIH LLLSDHVIPH MSSESLAHYL TALIPQMKVL FMSGYVVEPR PDVQALQEAT VLQKPFTPTA LLQRVRAVLD S
|
| |