Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4292 |
Symbol | |
ID | 4024815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4753184 |
End bp | 4754422 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637964500 |
Product | putative urea/short-chain amide transport system substrate-binding protein |
Protein accession | YP_571410 |
Protein GI | 91978751 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03669] urea ABC transporter, substrate-binding protein, archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0103855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATCGA GACGACTGCT CCAAACTGCT TTTCTCGGCC TCGCGCTCGG CGCCATTGCG CCGTTGGCGA CGCAGGCGGC CGAACCGCCG CTGAAGGTCG GCCTGCTCGA AGACATCTCC GGCGATCTCG CCTTCATGGG TATGCCGAAG TTGCACGGCT CGCAGCTCGC GGTCGAGGAA ATCAACAAGA GCGGCGGCAT CCTCGGCCGG CAGATCGAGC TGATCCATCT CGACCCGCAG GGCGACAACG CCCGTTACCA GGAGTTTGCC CGGCGGCTGC TCAATCGCGA CAAGGTCGAC GTCCTGATCG GCGGCATCAC CTCGGCGGCG CGCGAAGCAT TGCGTCCGAT CGTCGGCCGC ACCTCGACGC CGTATTTCTA CACGAACCAG TATGAAGGCG GCGTCTGCGA CGCCAGCATG ATCAGCATGG GCGCGGTGCC CGAGCAGCAG TTCTCGACGC TGGTTCCCTG GATGGTGGAG AAGTTCGGCA AGAAGGTCTA CGTCGTCGCC GCCGACTACA ATTTCGGCCA GATCTCGGCG GAATGGAACC GCAAGATCAT CAAGGATCTC GGCGGGCAGG TGGTCGGCGA GGAGTTCATC CCACTCGGAG TCTCGCAATT CGCGCAGACC ATCCAGAACA TCCAGAAGGC GAAGCCCGAC TGGTTGCTGA CGATCAATGT CGGCGCCGCG CAGGATTCGT TCTTCGAACA GGCGGCCGCG GCCAATCTCA ATCTGCCGAT GGGGTCGTCG ATCAAGGTGA TGCTCGGCTT CGAGCACAAG CGCTTCAAGC CGCCGGCGCT CAACAACATG CACGCCACCG CGAACTGGTT CGAGGAAATC GCCACGCCCG AGGCGGAGGC TTTCAAGAAG CGCTGGCGCG CCAAGTTCCC CGACGAAACC TACATCAACG ACATGGGCTA CAACGCCTAC AACGCGCTGT ACATGTACAA GACGCTGGCG GAAAAGGCGA AGTCGACCAA GCTCGAAGAC CTCCGCAAGG TGATCGCGAC CGGCGAAGCC TGCATCGATG CGCCCGAAGG CAAGGTCTGT ATCGATCCGA AGAGCCAGCA CACGTCGCAC CGGATGCGTC TGATCTCGGT CGGACCCAAG CACGACGTCA CGGTCGTCAA GGACTACGGC ACGATCCAGC CCTACTGGCT CGGCGAGGTC GGCTGCGACC TCACCAAGAA GAACGACAAG GAACAGTACA CGCCCAATCA GCTGCCGAAG AAGTCGTGA
|
Protein sequence | MRSRRLLQTA FLGLALGAIA PLATQAAEPP LKVGLLEDIS GDLAFMGMPK LHGSQLAVEE INKSGGILGR QIELIHLDPQ GDNARYQEFA RRLLNRDKVD VLIGGITSAA REALRPIVGR TSTPYFYTNQ YEGGVCDASM ISMGAVPEQQ FSTLVPWMVE KFGKKVYVVA ADYNFGQISA EWNRKIIKDL GGQVVGEEFI PLGVSQFAQT IQNIQKAKPD WLLTINVGAA QDSFFEQAAA ANLNLPMGSS IKVMLGFEHK RFKPPALNNM HATANWFEEI ATPEAEAFKK RWRAKFPDET YINDMGYNAY NALYMYKTLA EKAKSTKLED LRKVIATGEA CIDAPEGKVC IDPKSQHTSH RMRLISVGPK HDVTVVKDYG TIQPYWLGEV GCDLTKKNDK EQYTPNQLPK KS
|
| |