Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1556 |
Symbol | |
ID | 3908755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1754050 |
End bp | 1755330 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883452 |
Product | branched chain amino-acid ABC transporter substrate-binding protein |
Protein accession | YP_485177 |
Protein GI | 86748681 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA AATCACTGTT GAGCACGGCG TCGCTGGCGC TGCTGATCGC CGCGACGTCG GCCACCGCGC AGGCGCAGAT CGCGATCGGC CATCTCGCCG ATTATTCCGG CGGCACCTCG GACGTCGGCA CGCCCTACGG CCAGGCCGTC GCCGACACCT TCGCTTGGGT CAACAAGAAC GGCGGCGTCG GCGGCAAGCA GCTCAATGTC GACACCAACG ACTACGGCTA CCAGGTGCCG CGCGCGATCG CGCTGTACAA GAAATGGTCG GGCGGCGACA AGGTCGCGGC GATCATGGGC TGGGGCACCG CCGACACCGA GGCGCTGACC GGCTTCCTCG CCCAGGACAA GATCCCCGAC ATGTCGGGCT CCTACGCCGC GGCGCTGACC GACCCCGAAG GCACCAGCGG CAAGGCCAAG CCGGCGCCGT ACAACTTCTT CTATGGCCCG TCCTATTCCG ATGCGGTGCG CGCCGAACTG ATGTGGGCCG CCGAGGACTG GAAGGCCAAG GGCAAGACCG GCGCGCCGAA ATTCGTCCAC ATGGGCGCCA ATCATCCCTA CCCCAACGCG CCCAAGGCCG CCGGCGAAGC GCTCGCCAAG GAGCTCGGCT TCGAGGTGCT GCCGCCGCTG GTGTTCGCGC TGTCGCCGGG CGACTACAGC GCCCAGTGCC TCAGCCTGAA GAGCTCGGGC GCCAACTACG CCTATCTCGG CAACACCGCG GCCTCGAATA TTTCGGTGAT GAAGGCCTGC AAGGCCGCCG GCGTCGACGT GCAGTTCATG AGCAACGTCT GGGGCATGGA CGAGAACGCC GCCAAGGCCG CGGGCGACGC CGCCGATGGC GTGATCTTCC CGCTGCGGAC TGCGGTCGCC TGGGGCGGCA ACGCGCCCGG CATGAAGACG GTGGAGACGA TCTCCAAGAT GTCGGACCCG TCCGGCAATG TGTATCGGCC GGTGCATTAC GTCGCCGCGG TGTGCTCGGC GATGTACATG AAGGAAGCGC TCGACTGGGC CGCCAAGAAC GGCGGCGCCA CCGGCGAGAA CGTCGCCAAG GGCTTCTACC AGAAGAAGGA CTGGGTGCCG GCCGGGATGG AGGGCGTCTG CAACCCGTCG ACCTGGACCG CCAAGGACCA CCGCGGCACG ATGAAGATCG ACCTGTATCG CGCCAAGGTG TCGGGCGCGA CCGATGGCGA CCTCAAGGAC CTGATCGCGA AGGGCACGAT CAAGCTCGAA AAGGTCAAGA CCGTCGACCT GCCGCGCAAG CCGGAATGGT TCGGCTGGTG A
|
Protein sequence | MTMKSLLSTA SLALLIAATS ATAQAQIAIG HLADYSGGTS DVGTPYGQAV ADTFAWVNKN GGVGGKQLNV DTNDYGYQVP RAIALYKKWS GGDKVAAIMG WGTADTEALT GFLAQDKIPD MSGSYAAALT DPEGTSGKAK PAPYNFFYGP SYSDAVRAEL MWAAEDWKAK GKTGAPKFVH MGANHPYPNA PKAAGEALAK ELGFEVLPPL VFALSPGDYS AQCLSLKSSG ANYAYLGNTA ASNISVMKAC KAAGVDVQFM SNVWGMDENA AKAAGDAADG VIFPLRTAVA WGGNAPGMKT VETISKMSDP SGNVYRPVHY VAAVCSAMYM KEALDWAAKN GGATGENVAK GFYQKKDWVP AGMEGVCNPS TWTAKDHRGT MKIDLYRAKV SGATDGDLKD LIAKGTIKLE KVKTVDLPRK PEWFGW
|
| |