Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3020 |
Symbol | |
ID | 3973627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3316463 |
End bp | 3317701 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637926131 |
Product | putative urea/short-chain amide transport system substrate-binding protein |
Protein accession | YP_532884 |
Protein GI | 90424514 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03669] urea ABC transporter, substrate-binding protein, archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.692302 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGAA CTGTAGTTCG GGGACTGCAT GCCGCAGTCC TCGCGGGGAC GCTTGTCCTC GCCTCAAATA TGGCCTTCGC GGAGACGCCG ATCAAGCTCG GCGTGCTGGA AGATCAATCC GGCGACTTCG CGGTGGCAAC GATCGGAAAG GTGCACGCGA TTCAGCTCGC CGCGGAGGAG ATCAACAAGT CCGGCGGCAT CATGGGCCGT CCGCTCGAAC TGGTGATCTA CGACACGCAG TCGGACAATA CGCGCTACCA GGAATTCATG CGGCGCGTGC TGCAGCGCGA CAAGGCCGAC GTGGTGTTCG CGGGATTCTC CTCGGCATCG CGCGAAGCCT ATCGCCCGAT CGTCGATCAG CTCAACGGCT TTGCCTTCTA CAACAACCAG TATGAGGGTG GCGTCTGCGA CGGACATATG ATCGTCACCG GCGCGGTCCC GGAGCAGCAA TTCTCGACGC TGATCCCCTA CATGATGCAG GCCTACGGCA AGAAGGTCTA CACGCTCGCC GCCGACTATA ATTTCGGTCA AATCTCGGCC GAGTGGGTCC GCAAAATCGT CAAGGAAAAC GGCGGCGAAA TGGTCGGCGA GGAATTCATC CCGCTCGGCG TCTCGCAATT TTCGCAGAGC ATCCAGAATA TCCAGAAAGC CAAGCCGGAT TTCGTGGTGA CGCTGCTGGT AGGCACCGCG CAAGCCTCGT ACTACGAGCA GGCGGCCTCG GCCAACGTCA ACCTGCCGAT GGCGTCGTCG GTCAATGTCG GCCAGGGCTA CGAGCACAAG CGCTTCAAGG CGCCAAGCCT GAAGGACATG TACGTCACCA CCAACTACAT CGAGGAGATC GACTCCCCGA CGGCCAAGGC GTTCCTGGCA AAGTTCAAGG CCAAATTCCC CAACGAGCCC TATGTCAATC AAGAAGCCGA GAACTCCTAT CTCGCGGTCT ATCTGTACAA GCAAATGGTC GAGCGGGCGA AGTCGACCAA GCGCGACGAC ATCCGCAAGG TGATCGCGCA AGGCGACGTC TGCATGGACG CGCCTGAAGG CAAGGTCTGC ATCGACCCGA AGAGCCAACA CATGTCGCAC ACCATCTATC TGGCCAAGGT CGGCGCCGAT CATTCCATCA CCTTTCCGAA GGTCTGGGAG GGCATCAAGC CGTATTGGCT CGGCGACGCC GGGTGCGACC TGACCAAGAA GGATCCGACG GCGCAGTACA CGCCGTCGAA TCCGCCGCCG AAGCCGTAA
|
Protein sequence | MNRTVVRGLH AAVLAGTLVL ASNMAFAETP IKLGVLEDQS GDFAVATIGK VHAIQLAAEE INKSGGIMGR PLELVIYDTQ SDNTRYQEFM RRVLQRDKAD VVFAGFSSAS REAYRPIVDQ LNGFAFYNNQ YEGGVCDGHM IVTGAVPEQQ FSTLIPYMMQ AYGKKVYTLA ADYNFGQISA EWVRKIVKEN GGEMVGEEFI PLGVSQFSQS IQNIQKAKPD FVVTLLVGTA QASYYEQAAS ANVNLPMASS VNVGQGYEHK RFKAPSLKDM YVTTNYIEEI DSPTAKAFLA KFKAKFPNEP YVNQEAENSY LAVYLYKQMV ERAKSTKRDD IRKVIAQGDV CMDAPEGKVC IDPKSQHMSH TIYLAKVGAD HSITFPKVWE GIKPYWLGDA GCDLTKKDPT AQYTPSNPPP KP
|
| |