Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2183 |
Symbol | |
ID | 4022668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2437866 |
End bp | 2438936 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637962378 |
Product | putative taurine transport system protein |
Protein accession | YP_569319 |
Protein GI | 91976660 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR03427] ABC transporter periplasmic binding protein, urea carboxylase region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.746954 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAAC CTAGCTTGAT CTATTCCCTG ATCATCGCCG GCTCGATCGT CGCGCTCTCC GCGGCCGGGG CCTCGGCTGC CCCGAAGAAG GACTTCAAGG TCGCCTGGTC GATCTATGTC GGCTGGATGC CGTGGGGTTA CGCCGCCGAC ACCGGCATCG TCAAAAAATG GGCCGACAAA TACGGCATCA CCATCGAGGT GAAGCAGTTC AACGACTACG TCGAGTCGAT CAACCAATAC ACCGCCGGCG CCTATGACGC GGTGACGATC ACCAACATGG ACGCATTGTC GATCCCCGCC GCGGGTGGCG TCGATACCAC CGCCGTGGTG ATGGGGGACT TCTCCAACGG CAACGACGCG GTGATCCTGA AGGGCAAGGC CGATCTCGCC GCGATCAAGG GGCAGAAGGT CAACCTGGTC GAATTCTCGG TCTCGCACTA TCTGCTGGCG CGCGCGCTGG AGAGCAAGCA GCTCAGCGAG AAGGACATCA AGGTCGTCAA CACCTCCGAC GCCGACCTCG CCGCGGCTTA CAAGACGGCG GACGTCACCG CTGTGGTTAC CTGGAATCCG ATCGTCTCGG AAATCCTGTC CGCTCCCGAC GCCAAGAAGG CATTCGACTC CTCGCAAATC CCCGGCGAGA TCATGGATCT GATGGTCGCC AACAGCGCAG TGGTGAAGGA CAATCCGGAC TTCGCCAAGG CTCTGGTCGG GATCTGGTAC GAGACTATCG CCAAGATGAA CGCGACCGGC GACGAGGGCA AGGCCGCCAA GGAAGCGATG GCGAAAGCCT CCGGCACTGA TCTCGCCGGC TTCGACAGCC AGCTCGCCTC GACCAAATTG TTCGACAAGG CTGCGGAGGC GGAAGCGTTC ACCAGGAGCG CCACGGTCGG CACGACCATG GATCGCGTCC GCAAATTCCT GTTCGAGAAG GACCTGCTCG GCAAGGGCGC GAAGTCGGCT GACGCGGTCG GCATCGAACT GGCGGACAAG ACGGTGCTCG GCGACAAGTC GAACGTAAAG CTGCGCTTCG ACGCGACCTA CATGGACGCC GCCGCCAAGG GCAAGCTGTG A
|
Protein sequence | MRKPSLIYSL IIAGSIVALS AAGASAAPKK DFKVAWSIYV GWMPWGYAAD TGIVKKWADK YGITIEVKQF NDYVESINQY TAGAYDAVTI TNMDALSIPA AGGVDTTAVV MGDFSNGNDA VILKGKADLA AIKGQKVNLV EFSVSHYLLA RALESKQLSE KDIKVVNTSD ADLAAAYKTA DVTAVVTWNP IVSEILSAPD AKKAFDSSQI PGEIMDLMVA NSAVVKDNPD FAKALVGIWY ETIAKMNATG DEGKAAKEAM AKASGTDLAG FDSQLASTKL FDKAAEAEAF TRSATVGTTM DRVRKFLFEK DLLGKGAKSA DAVGIELADK TVLGDKSNVK LRFDATYMDA AAKGKL
|
| |