Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3877 |
Symbol | |
ID | 3911681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4431943 |
End bp | 4433163 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885778 |
Product | urea/short-chain binding protein of ABC transporter |
Protein accession | YP_487481 |
Protein GI | 86750985 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.188449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCCA AACCATTGGC GGTGGCGATA ATGACGGCCG CATGGCTGTC ACCGTCGGCC GCGTTCGCAC AAGCCTCGCC CCAGATTTCC GACGACGTCG TCAAGATCGG CGTGCTCACC GACATGAACG GCCCGGCCTC GACGCCGACC GGCCAGGGCT CGGTCACCGC CGCGCAGATG GCGGTCGAGG ATTTCGGCGG CAGCGTGCTC GGCAAGCCGA TCAGCATCAT CGTCGGCGAT CATCAGCTCA AGCCCGATAT CGGCGCCGCC CTGGCGCGGC GCTGGTACGA CGTCGAACAG GTCGATCTGA TCGTCGACGT GCCGGTCTCC GCCGTCGGCC TCGCGGTGCA GAACATCGCG GGCGAGAAGA AGCGGATGTT CATCACCCAT TCGACCGGCG CCGCCGATTT TCACGGCAAG TTCTGCTCGC CTTACGCGAT GCAATGGGTG TTCGACACCC GCGCGCTCGC GGTCGGCACC GCGCAGGAAG TGGTCAAGCG CGGCGGCGAC AGTTGGTTCT TCATCACCGA CGACTACGCT TTCGGCCAGT CGCTGGAGCG CGACGCCGCC GCGGTCGTCA CCAAGTCCGG CGGCAAGGTG CTGGGCGCGG TGCGGCCTCC TTTCGCGACG CCGGATCTAT CGTCCTTCGT GCTGCAGGCG CAGGCCTCGA AGGCCAAGAT CATCGGCATC GCCGGCGGCC CGCCGAACAA TATCAATGAA ATTAAAACCG GCGCCGAATT CGGCGTGTTC AAGGGAGGCC AGCAGATGGC GGCGCTGCTG GCGCTGATCA CCGACATCCA CTCGCTCGGC CTGCCGGCCG CGCAAGGCCT GCTGCTGACG ACGTCGTTCT ATTGGGACAT GGACGACAAG ACCCGGGAAT GGTCGAAGCG CTATTTCGCC AAGATGAACC GGATGCCGAC GATGTGGCAG GCCGGCGTGT ATTCGTCGAC CATGCACTAT CTGCAGGCGA TCAAGGACGC CGGCACCGAC GAGCCGCTGC AGGTCGCGGC CAAGATGCGC GAGAAGCCGA TCGAGGATTT CTTCTCCCGC AACGGCCGGC TGCGCGAGGA CGGGTTGATG GTGCACGATC TGATGCTGGT GCAGGTGAAG TCCCCGGAAG AGTCGAAATA TCCATGGGAC TATTACAAGA TCCTCGCCAA AATCTCCGGC GCCGAGGCGT TCGGTCCGCC CGACCCGGCC TGCCCGCTGG TCAAGAAATA G
|
Protein sequence | MVAKPLAVAI MTAAWLSPSA AFAQASPQIS DDVVKIGVLT DMNGPASTPT GQGSVTAAQM AVEDFGGSVL GKPISIIVGD HQLKPDIGAA LARRWYDVEQ VDLIVDVPVS AVGLAVQNIA GEKKRMFITH STGAADFHGK FCSPYAMQWV FDTRALAVGT AQEVVKRGGD SWFFITDDYA FGQSLERDAA AVVTKSGGKV LGAVRPPFAT PDLSSFVLQA QASKAKIIGI AGGPPNNINE IKTGAEFGVF KGGQQMAALL ALITDIHSLG LPAAQGLLLT TSFYWDMDDK TREWSKRYFA KMNRMPTMWQ AGVYSSTMHY LQAIKDAGTD EPLQVAAKMR EKPIEDFFSR NGRLREDGLM VHDLMLVQVK SPEESKYPWD YYKILAKISG AEAFGPPDPA CPLVKK
|
| |