Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4662 |
Symbol | |
ID | 3912480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5273404 |
End bp | 5274612 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637886567 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_488256 |
Protein GI | 86751760 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.841519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGA TCCGACGTTT GCCGCCGCTA TCCCGGCGTG CGCTGCTGAC CGGCGCCGCG GGCGCCGCCA CCATTGCAGT CGCGCCGCGC TTCGCCACGC CCGCCATCGC GCAGACCTCG CCGCTCAAGG TCGGGCTGAT GCTGCCCTAC ACCGGCACGT TCGCGAAGCT CGGCCAGTTC ATCGACGACG GCTTCCGGCT GCGCGTCGAG CAGGCCGGCG GTAAGCTCGG CGGGCGTGAT GTGACCTTCG TGCAGGTCGA CGACGAGTCC AAGCCGGAGG CCGCCACCGA CAACATGAAT CGCCTGGTCG GCCGTGAGAA GGTCGACGTC GTGGTCGGCA CCGTGCATTC CGGCGTCGCG ATGGCGATGG TCAAGGTCGC GCGCGATAGC GGCACGCTGC TGATCATTCC CAACGCCGGC GCCAACGACG CCACCGGACC GGCCTGCGCG CCGAACATCT TCCGCACCTC GTTCTCGAAC TGGCAGACCA CCTTCCCGAT GGGCAAGGTG ATGGCGGACG CGGGCATCAA GAATGTCGTC ACCATCACCT GGAAGTACAC CGCCGGCGCC GAAATGGTCG GCGCCTTCGC GGAGAACTTC ACCAGGAACG GCGGCAAGAT CGTCGAGGAT CTGACGCTGC CGTTCCCGCA GGTCGAATTC CAGGCGCTGA TCACGCGCAT CGCGCAGCTC AAACCCGACG CGGTGTTCAG CTTCTTCGCC GGCGGCGGCG CGGTGAAATT CGTCAAGGAC TACGCCGCGG CGGGCCTCAA CAAGACGATT CCGCTGTATG GCGCGGGCTT TCTCACCGAC GGCACCATCG AGGCGCAGGG CGAGGCGGCC AACGGGATCA AGACGACGCT GCACTACGCC GACAATCTCG ACAACCCCGC CAACGTCGCC TTCCTCAAGG CGTTCAAGGC CAAGACCCAG AAGGACGGCG ACATCTACGC GGTGCAGGGC TTTGACGCCG CCGCGCTGCT CGATATCGGC CTCGGCGCGG TGAAGGGCGA TGCCGGCGCG CGCGACACGA TGATCAAGGC GATGGCGGCG GCCAAGATCG ACAGTCCGCG CGGGCCGCTG TCGTTCAACA AGGCGCACAA CCCGATCCAG AATATCTATC TGCGCGAGGT GAAGAACGGC CGCAACGAAA TGGTGTCGAT CGCGCAAGCC GCTGTCGACG ACCCGGCGCG CGGCTGCAAG ATGACGTGA
|
Protein sequence | MSTIRRLPPL SRRALLTGAA GAATIAVAPR FATPAIAQTS PLKVGLMLPY TGTFAKLGQF IDDGFRLRVE QAGGKLGGRD VTFVQVDDES KPEAATDNMN RLVGREKVDV VVGTVHSGVA MAMVKVARDS GTLLIIPNAG ANDATGPACA PNIFRTSFSN WQTTFPMGKV MADAGIKNVV TITWKYTAGA EMVGAFAENF TRNGGKIVED LTLPFPQVEF QALITRIAQL KPDAVFSFFA GGGAVKFVKD YAAAGLNKTI PLYGAGFLTD GTIEAQGEAA NGIKTTLHYA DNLDNPANVA FLKAFKAKTQ KDGDIYAVQG FDAAALLDIG LGAVKGDAGA RDTMIKAMAA AKIDSPRGPL SFNKAHNPIQ NIYLREVKNG RNEMVSIAQA AVDDPARGCK MT
|
| |