Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1378 |
Symbol | |
ID | 4021855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1547688 |
End bp | 1548695 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637961571 |
Product | thiosulphate-binding protein |
Protein accession | YP_568517 |
Protein GI | 91975858 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGTT TCGCCCTTGC ATTGTTCGCT GGTGTCGGCG CGCTCGTAAC CGCGCTGCCT GCGCAGGCGC AGACGTCGAC CACGCTGCTC AACGTCTCCT ACGATATCTC CCGCGAGCTC TACGCCGAGA TCAACGCGGC GTTCATTCCG CAATGGAAGG CGAAAACCGG ACAGGACATC GCCATCAACC AGTCGCATGG CGGCTCGTCC CGGCAGGCGC GCTCGATCCT CGAAGGGCTC GAAGCCGACG TGGTGACCTT CAACCAGGTC ACCGACGTCC AGGTGCTGCA TGACAAAGGC AAGCTGATCC CGGCCGACTG GGCGAAGCGG CTACCGAACA ATTCCTCGCC GTATTATTCG CTGCCGGCGT TTCTGGTTCG CGCCGGCAAC CCGAAAGGGA TCAAGGATTG GGACGATCTG GTGAAGCCGG ACGTCAAGGT GATCTTCCCG AATCCGAAGA CCTCGGGCAA CGGCCGCTAT ACCTATCTGG CGGCCTATGC CTTCGCGAAG CAGAAATACG GCAACGAGGC GGAGGCCGAC GCGTTCATCA AGAAGCTGTT CGCCAATGTG CCGGTGTTCG ACACCGGCGG CCGCGCCGCG ACGACGACCT TCGTCGAGCG TCAGACCGGC GACGTGCTGA TCACTTTCGA GGCCGAAACC AGCTCGATCC GCGACCTCGC CGGAGCCGAC AAGTATCAAG TCGTGGTGCC GCCGACCAGC CTGCTGGCCG AATTCCCGGT CAGCGTGGTC GACAAATACG CCGACAAGCA CGGTACCCGC GCGCTCGCCA CCGCCTATCT CGAATATCTG TATTCGCCCG AGGGCCAGAC CATCCTCGCC AAGGCGTATA ACCGCGTGCA AGACAAGGCC GTGATCGAGA AGTTCAAGGA CAAGTTCCCG GAGGTGAAGC TGTACCGGGT CGAGGACGAA TTCGGCGGCT GGGACAGGCT CAACGCCGCG CACCTCGCCT CCGGCGCCAA GCTCGATCAG CTGTTCGGCG GACGGTGA
|
Protein sequence | MNRFALALFA GVGALVTALP AQAQTSTTLL NVSYDISREL YAEINAAFIP QWKAKTGQDI AINQSHGGSS RQARSILEGL EADVVTFNQV TDVQVLHDKG KLIPADWAKR LPNNSSPYYS LPAFLVRAGN PKGIKDWDDL VKPDVKVIFP NPKTSGNGRY TYLAAYAFAK QKYGNEAEAD AFIKKLFANV PVFDTGGRAA TTTFVERQTG DVLITFEAET SSIRDLAGAD KYQVVVPPTS LLAEFPVSVV DKYADKHGTR ALATAYLEYL YSPEGQTILA KAYNRVQDKA VIEKFKDKFP EVKLYRVEDE FGGWDRLNAA HLASGAKLDQ LFGGR
|
| |