Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1803 |
Symbol | |
ID | 4895916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1900373 |
End bp | 1903057 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640112397 |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_001043682 |
Protein GI | 126462568 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.27583 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCG ATCTCCTCGT TCTCTCCTGC CTCTGCTACG TCATCCTGCT CTTCCTCGTG GCCTTCCTCG TCGAGCGCCG GGCGGTGGGC GAGGCGCTGC CGTGGCTGCG CTCGCCGCTC GTCTATACGC TCTCGCTCTC GATCTACTGC ACCGCCTGGA CCTTCTACGG CGCGGTGGGA TATGCGGCGC GCTCGGGGCT CGAATATCTC ACGATCTATC TCGGGCCGAC GCTCGTCTTC GTCGGCTGGT GGTGGGTGCT GCGCAAGCTG GTGCGGATCG GGCGGCAGCA GCGCGTGACC TCGATCGCGG ACCTGATCTC CTCGCGCTAT GGCAAATCGA ACCTGCTCGG CGTGCTCGTC ACGCTCCTCT GCGTGGTGGC CGCCACGCCC TATATCGCGC TCCAGCTCCA GTCCGTCACG CTGAGCTTCA GCGTCTTCGC CAGCGGCCCG CACGAGGGCT GGCAGATGCC CGACCGCGAC AGCACCGCGA TCTGGGTCGC GGGGGGGCTT GCGCTCTTCA CCATCCTCTT CGGCACGCGG AAGCTCGATG CGAACGAGCA GCATACGGGC ATCGTCACCG CCATTGCCGT CGAGGGAATC GTGAAGCTGG TCGCGCTGGT GGCGGTGGGC AGCTACGTCG TCTGGGGCGT GGCCGACGGG CCCGCCGACG TGATCGCCCG CATCGACCGC TCGGCCATCG CCGACTGGGA ACTCGCGCCC TCGCGCTGGG CGGGCCTGAC CTTCCTGTCG GCCGCAGCGG TGGTCTGCCT GCCGCGCATG TTCCAGGTGC TGGTGGTCGA GAACGGGGAC GAGCGCCACC TCGCCACGGC GAGTTGGGCC TTCCCGCTCT ACATGTTCCT GATGAGCCTC TTCGTGGTGC CGATCGCGGT CGTGGGGCTC GAGCTTCTGC CCGAGGGGGC CAACCCGGAC CTGTTCGTCC TGACCGTGCC GCTCGACCGC GGGCAGGGCG GGCTGGCCAT GCTGGCCTTC CTCGGCGGCT TTTCCTCGGC CACGGCGATG GTGATCGTGG CCGCGGTGGC GCTGGCCACG ATGGTGTCGA ACCATATCGT CATGCCGCTC TGGCTGTCGC TGCGCCCGAT GCCGGCCGTC AGCGGCGACG TGCGCCGCCT TGTGCTGCTC TCCCGGCGGA TCTCCATCGC GGGCGTTCTG GCGCTGGGCT ACGGCTATTA CCGCTTCTCG GGCGGCTCGA GCGCGCTGGC CGCGATCGGC CTCATCTCCT TCGTGGGCGT GGCGCAGATC CTGCCCGCGA TGCTGGGCGG CATCTTCTGG CGCGGCGCCA CCCGGACGGG CACGCTCTGG GGCCTCAGCC TCGGCTTTGC GGTCTGGGTC TATACGCTCT TCCTGCCCTC CTTCGGCCCG GATGCGATCT TCTCGGCCCG CCTCCTCGCC GAGGGACCGT TCGGGATCGG CTGGCTCCGT CCGCAGGCGC TGTTCGGGGT CGAGGGGATG GATCCGCTGA TCCATTCGTT GTTCTGGTCG CTCATCCTGA ACGGCACCGT CTTCTGCGGC GCCTCGCTCG TGACCTTCCC CGGCCCGCTC GAGCGGTTGC AGGGCGCGGC CTTCGTCAAT GTGTTCGAGA CCGGCAACCG CGGCCCGCAG GGATGGGCGC AGGGCAAGGC CGAATCCGAG GAGCTGCTGG TGATGGCGCA GCGGATCCTC GGCGAGGAGC CGGCGCTGGC GCTCTTCCAG TCCGAGACGC GCGCGCAGGG AAAGGAGGGC TATCTGCCCG ATCCGGTCCC CGGCTTCGTC GAGCGGCTCG AGCGGCGGCT TGCGGGCTCG GTCGGCGCGG CCACGGCCCA TGCGATGCTG GCCCAGCTGG CCGGGCGCGC CGCGGTCTCG GTCGAGGATC TGATGGCGGT GGCGAACGAG ACCGCACAGA TCATGGAATA TTCCGCGCGG CTCGAGGCCC AGCAGGACGA GCTGACCCGC ACCGCGCGCC AGCTGCGCGA GGCCAACGAG AAGCTCACGC AGCTGTCGGT GCAGAAGGAC GCCTTCCTCA GCCAGATCAG CCACGAGCTG CGCACCCCCA TGACCTCGAT CCGCGCCTTC TCCGAGATCC TGATGGACGG CGATCTGCCG CCCGAGATGG CCGCGCGCCA CGCCCGCATC ATCCATGACG AGGCCATCCG GCTCACCCGG CTGCTCGACG ACCTGCTCGA CCTTTCCGTG CTCGAGAACG GATCGGTGCA GCTCGACCTC GGGCTCGCCA ATCTCCAGCA GATGATCGAC CGCGCCCTGA GCTCGGCCGC CCAAACCCGC CCCGAGCGCG GCTTCACCAT CCATCGCGAT CCGGCGGCCG AGAACATCTT CCTGTTCACC GATGGCGACC GGCTGGCGCA GGTCTTCATC AACCTGATCT CGAACGCGCG CAAATACTGC GACGCGGACT ATCCCGAACT GCGGATCTCG GTGCGCCAGA AGGGCGGGCG CGTGACGGTC GATTTCATCG ACAACGGTTC CGGGATCTCG AAGGAGAGCC AGGAGCTGAT CTTCGAGAAG TTCGCCCGGC TTTCGGACCA GACCCGGGCC GGCGGGGCGG GCCTCGGCCT CGCGATCTGC CGCGAGGTGA TGGCCAACCT CGGCGGCACC ATCACCTACC TGCCGGGACA GGGGGGCGCT GCCTTCCGCG TGACCCTGCC TCTGCGGCTC CAGCGCGCCG CCTGA
|
Protein sequence | MPFDLLVLSC LCYVILLFLV AFLVERRAVG EALPWLRSPL VYTLSLSIYC TAWTFYGAVG YAARSGLEYL TIYLGPTLVF VGWWWVLRKL VRIGRQQRVT SIADLISSRY GKSNLLGVLV TLLCVVAATP YIALQLQSVT LSFSVFASGP HEGWQMPDRD STAIWVAGGL ALFTILFGTR KLDANEQHTG IVTAIAVEGI VKLVALVAVG SYVVWGVADG PADVIARIDR SAIADWELAP SRWAGLTFLS AAAVVCLPRM FQVLVVENGD ERHLATASWA FPLYMFLMSL FVVPIAVVGL ELLPEGANPD LFVLTVPLDR GQGGLAMLAF LGGFSSATAM VIVAAVALAT MVSNHIVMPL WLSLRPMPAV SGDVRRLVLL SRRISIAGVL ALGYGYYRFS GGSSALAAIG LISFVGVAQI LPAMLGGIFW RGATRTGTLW GLSLGFAVWV YTLFLPSFGP DAIFSARLLA EGPFGIGWLR PQALFGVEGM DPLIHSLFWS LILNGTVFCG ASLVTFPGPL ERLQGAAFVN VFETGNRGPQ GWAQGKAESE ELLVMAQRIL GEEPALALFQ SETRAQGKEG YLPDPVPGFV ERLERRLAGS VGAATAHAML AQLAGRAAVS VEDLMAVANE TAQIMEYSAR LEAQQDELTR TARQLREANE KLTQLSVQKD AFLSQISHEL RTPMTSIRAF SEILMDGDLP PEMAARHARI IHDEAIRLTR LLDDLLDLSV LENGSVQLDL GLANLQQMID RALSSAAQTR PERGFTIHRD PAAENIFLFT DGDRLAQVFI NLISNARKYC DADYPELRIS VRQKGGRVTV DFIDNGSGIS KESQELIFEK FARLSDQTRA GGAGLGLAIC REVMANLGGT ITYLPGQGGA AFRVTLPLRL QRAA
|
| |