Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3572 |
Symbol | |
ID | 3911374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4095246 |
End bp | 4097015 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885474 |
Product | branched-chain amino acid transport system ATP-binding protein |
Protein accession | YP_487178 |
Protein GI | 86750682 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component [COG4177] ABC-type branched-chain amino acid transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.317873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGTT GGCTTCCCAT TCTGCTGTTC GCCGCCGTGA TGACGGCGCT GCCGCTGATC CCTGGCATGC CGCCGTTCTG GATCGTGCTC TTGGACAATA TCGGCCTCGC CGCTCTGGTG GCGATGGGCC TGGTGCTGCT GACCGGCGTC GGCGGCCTCA CCTCGTTCGG CCAGGCGGCG TTCTGCGGCT TCGGCGCCTA CACCACCGCC TATCTGGCCA CGGTCTACGG CGTCTCGCCG TGGCTGACGC TGCCGCTCTC GCTGTTGGTC GCCGGCACGG CCGCGGTGCT GCTCGGGCTG ATGACGGTGC GGCTCAGCGG CCATTATCTG CCGCTCGGCA CCATCGCCTG GGGCATCTCG CTGTACTACC TGTTCAGCAA GCTGGATTTT CTCGGCCGCA ATGACGGCAT CTCGCAAGTG CCGCCGCTGT CGATCGGATC GCTGCAAATG TTCGATCCCG AGACGATCTA TTTCGTGATC TGGGGCATGG TACTGATCAG CGCGGTGCTG ACCATGAACC TGCTCGACAG CCGCACCGGC CGCGCCATCC GCGCGCTGCG CCGCGGGCAC ATTGCGGCGG AAGCCTTCGG CGTGCAGACC GCGCGGGCGA AGCTGCTGGT GTTCATCTAT GCGGCGGTGC TGGCCGGCCT GTCCGGCTGG CTCTACGCGC ACTTCCAGCG CAACGTGAAC CCGACCCCGT TCGGCCCGCA GGCCGGCATC GAGTATCTGT TCATCGCCGT GGTCGGCGGC GCCGGCTATG TCTGGGGCGG CGTGCTCGGC GCTGCGATCG TGATCATCCT GAAAGAAGTT CTGCAGGGCT ATCTGCCGAT GCTGTTCGGC GGCCAGGGTC AGCTCGAGAT CATCGTGTTC GGCATCCTGC TGGTGTTGCT GCTGCAGCTC GCGCCCGGTG GCGTCTGGCC GTGGCTGAGC AGTCTGATCC CGATCAAGAT CCGACGCTCC CGGGTGGACG CCAGCGCCGC GCTCACGCCC CGCGAACGCA CGCCGGGCGC GACCGGCCCG CTGCTCAAAG TCGAACGCGC GCGAAAGCAG TTCGGCGGCG TGGTCGCGGT CAACGACGTC TCGTTCGAGG TCGATGCGCG CGAGATCGTC GCGCTGATCG GACCGAACGG CGCCGGCAAG TCGACCACCT TCAACCTGAT CACCGGCATT CTCACCACCA CCGGCGGCAG GATCGAGCTG CACGGCAAGC CGATCGACAA CGCGCCGCCG CAGGACGTGG TGCAGCTCGG CATCGCCCGC ACCTTCCAGC ACGTCAAGCT GGTGCCGGAC ATGACCGTAC TGGAGAACGT CGCGATCGGC GCGCATCTGC GCGGCAATGC CGGCGCGCTG ACCAGCATGC TGCGGCTCGA CCGCGCCGAC GAGGCCAAGC TGCTCGCCGA AGCCACCCGG CAGATCGAGC GCGTCGGGCT CGGCGGCGAG ATCGATCAGC TCGCAGGCAG TCTGTCGCTC GGTCAGCAGC GCATCGTCGA GATCGCGCGC GCGCTGTGCG CCGATCCGAT GCTGCTGCTA CTCGACGAAC CCGCCGCGGG CCTGCGCCAC ATGGAGAAAC AGCGCCTCGC CGCATTGCTG CGCCAGTTGC GCGACGGCGG CATGTCGGTG CTGCTGGTCG AACACGACAT GGGCTTCGTG ATGGATCTCG CCGACCGCAT CGTCGTGCTG GACTTCGGCA CGCGGATCGC CGAGGGCACG CCCGACGCGA TCAAGACCAA TCCCGAAGTC ATCAAGGCCT ATCTCGGAGC CCTGGCATGA
|
Protein sequence | MPRWLPILLF AAVMTALPLI PGMPPFWIVL LDNIGLAALV AMGLVLLTGV GGLTSFGQAA FCGFGAYTTA YLATVYGVSP WLTLPLSLLV AGTAAVLLGL MTVRLSGHYL PLGTIAWGIS LYYLFSKLDF LGRNDGISQV PPLSIGSLQM FDPETIYFVI WGMVLISAVL TMNLLDSRTG RAIRALRRGH IAAEAFGVQT ARAKLLVFIY AAVLAGLSGW LYAHFQRNVN PTPFGPQAGI EYLFIAVVGG AGYVWGGVLG AAIVIILKEV LQGYLPMLFG GQGQLEIIVF GILLVLLLQL APGGVWPWLS SLIPIKIRRS RVDASAALTP RERTPGATGP LLKVERARKQ FGGVVAVNDV SFEVDAREIV ALIGPNGAGK STTFNLITGI LTTTGGRIEL HGKPIDNAPP QDVVQLGIAR TFQHVKLVPD MTVLENVAIG AHLRGNAGAL TSMLRLDRAD EAKLLAEATR QIERVGLGGE IDQLAGSLSL GQQRIVEIAR ALCADPMLLL LDEPAAGLRH MEKQRLAALL RQLRDGGMSV LLVEHDMGFV MDLADRIVVL DFGTRIAEGT PDAIKTNPEV IKAYLGALA
|
| |