Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4018 |
Symbol | |
ID | 3969208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4467088 |
End bp | 4468095 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637927122 |
Product | thiosulphate-binding protein |
Protein accession | YP_533863 |
Protein GI | 90425493 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.206129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.132982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGTC TTGCCCTCGC CTTCATCGCC GGTCTCGGCG CCCTCACCGC GGTGTCCGCG GCGTGGGCGC AGACGCCGGC CACGCTGCTC AACGTCTCCT ACGACATTTC GCGCGAGCTC TATGTCGAGA TCAACGCCGC CTTCACCAAG CAATGGAAGG CCAAGACCGG CCAGGACGTC ACCATCAACC AGTCGCACAA CGGCTCGTCG CGGCAGGCCC GCTCGATCCT CGAAGGGCTC GAGGCCGACG TCGTGACCTT CAACCAGGTC ACCGACGTGC AGGTGCTGTA CGACAAGGGC AAGCTGATCC CGGCGGACTG GGCCAAGCGG CTGCCGAACA ATTCCTCGCC GTATTACTCG CTGCCGGCAT TCCTGGTGCG CGCCGGAAAT CCCAAGGCCA TCAAGGATTG GGACGATCTG GTGAAGCCCG GCGTGCAGGT GATCTTCCCC AACCCGAAGA CCTCGGGCAA TGCCCGCTAC ACCTATCTCG CCGCCTATGC CTTCGCGAAG CACAAGTACG GCAACGAGGC CGAGGCTGAT GCCTTCATCA AGAAGCTGTT CGGCAACGTG CCGGTGTTCG ACACCGGCGG TCGCGCCGCC ACCACCACCT TCATCGAGCG GCAGACCGGC GACGTGCTGA TTTCCTTCGA AGCCGAGACC AGCGCGATCC GCGACATCGC CGGCAAGGAC AAGTACCAAG TGGTGGTGCC GCCGACCAGC CTCTTGGCGG AATTCCCGGT CAGCGTGGTC GACAAATACG CCGACAAGCA CGGCACCAGG CCGCTCGCCA CCGCCTATCT GGAATATCTT TACTCGCCGG AGGGACAGAC CATTTTGGCC AAGGCCTATA ACCGGGTGAA CGACAAGGCC GTGGCGGAGC AATTCAAGGA CAAGTTCCCC GAGGTCAAGC TGTACCGGGT CGAGGACGAA TTCGGCGGCT GGGACAAGCT CACCGCCGAT CACCTCGCCT CCGGCGCCAA GCTCGATCAA TTGTTCGGCG GACGCTAG
|
Protein sequence | MNRLALAFIA GLGALTAVSA AWAQTPATLL NVSYDISREL YVEINAAFTK QWKAKTGQDV TINQSHNGSS RQARSILEGL EADVVTFNQV TDVQVLYDKG KLIPADWAKR LPNNSSPYYS LPAFLVRAGN PKAIKDWDDL VKPGVQVIFP NPKTSGNARY TYLAAYAFAK HKYGNEAEAD AFIKKLFGNV PVFDTGGRAA TTTFIERQTG DVLISFEAET SAIRDIAGKD KYQVVVPPTS LLAEFPVSVV DKYADKHGTR PLATAYLEYL YSPEGQTILA KAYNRVNDKA VAEQFKDKFP EVKLYRVEDE FGGWDKLTAD HLASGAKLDQ LFGGR
|
| |