Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4575 |
Symbol | |
ID | 6412259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4928690 |
End bp | 4929970 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714455 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_001993544 |
Protein GI | 192292939 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATGA AATCCCTGCT CAGCAGCGCC GCCCTGGCGC TGGTGATCGC CGGCACCTCA GCGACCGCGC AGGCGCAGAT CGCGATCGGC CACCTCGCCG ACTATTCGGG CGGCACCTCG GACGTCGGCA CGCCCTACGG CCAAGCGGTC GCCGACACCT TCGCCTGGGT CAACAAGAAC GGTGGCGTCG CCGGCAAGCA GCTCAACGTC GACACAAACG ACTACGGCTA CCAGGTGCCG CGCGCGATCG CGCTGTACAA GAAGTGGTCG GGTGGAGACA AGGTCGCGGC GATCATGGGC TGGGGCACCG CCGACACGGA GGCGCTGACC GGCTTCCTCG CCAACGACAA GATCCCCGAC CTGTCCGGCT CCTACGCAGC GGCGCTGACC GACCCCGAAG GCACCAGCGG CAAGGCCAAG CCCGCCCCCT ACAACTTCTT CTACGGCCCG TCCTATTCCG ACGCGCTACG CGCCGAGCTG ATGTGGGCGG CGGAAGACTG GAAGGCCAAG GGCAAGTCCG GCGCGCCGAA ATTCGTCCAC ATGGGCGCGA ACCATCCCTA CCCCAACGCT CCGAAGGCCG CCGGCGAAGC GATAGCCAAG GAGCTTGGCT TTGAAGTGCT ACCGCCGCTG GTGTTCGCGC TGGCGCCCGG CGACTACTCG GCGCAATGCC TGTCGCTGAA GAGCTCGGGC GCCAACTACG CCTACCTCGG CAACACCGCG GCCTCCAACA TCTCGGTGAT GAAGGCCTGC AAGGCGGCCG GCGTCGACGT CCAGTTCATG AGCAACGTCT GGGGCATGGA CGAGAACGCC GCCAAGACCG CCGGCGATGC CGCCGACGGC GTGATCTTCC CGCTGCGCAC AGCCGTGGCC TGGGGCGGCA AGGCGCCCGG CATGAAGACC GTGGAGGAAA TCTCGAAGAT CTCCGATCCG TCCGGCAACG TCTATCGCCC GGTGCACTAC GTCGCAGCAG TGTGCTCGGC GCTGTACATG AAGGAGGCGA TCGAGTGGGC GGCGAAGAAC GGCGGCGCCA CCGGTGAGAA CGTCGCCAAG GGCTTCTATC AGAAGGCCGA CTGGGTGCCG GCCGGCATGG AAGGCGTCTG CAACCCGTCG ACCTGGACCG CCAAGGACCA CCGCGGCACG ATGAAGATCG ACCTCTACCG CGCCAAGGTG TCGGGCCCGA CCGACGGCGA CCTCAAGGAC CTGATCGCCA AGGGCACCAT CAAGCTCGAG AAGGTCAAGA CCGTCGACCT GCCGCGCAAG CCGGAATGGG CCGGGTGGTG A
|
Protein sequence | MTMKSLLSSA ALALVIAGTS ATAQAQIAIG HLADYSGGTS DVGTPYGQAV ADTFAWVNKN GGVAGKQLNV DTNDYGYQVP RAIALYKKWS GGDKVAAIMG WGTADTEALT GFLANDKIPD LSGSYAAALT DPEGTSGKAK PAPYNFFYGP SYSDALRAEL MWAAEDWKAK GKSGAPKFVH MGANHPYPNA PKAAGEAIAK ELGFEVLPPL VFALAPGDYS AQCLSLKSSG ANYAYLGNTA ASNISVMKAC KAAGVDVQFM SNVWGMDENA AKTAGDAADG VIFPLRTAVA WGGKAPGMKT VEEISKISDP SGNVYRPVHY VAAVCSALYM KEAIEWAAKN GGATGENVAK GFYQKADWVP AGMEGVCNPS TWTAKDHRGT MKIDLYRAKV SGPTDGDLKD LIAKGTIKLE KVKTVDLPRK PEWAGW
|
| |