Gene RPB_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3572 
Symbol 
ID3911374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4095246 
End bp4097015 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content67% 
IMG OID637885474 
Productbranched-chain amino acid transport system ATP-binding protein 
Protein accessionYP_487178 
Protein GI86750682 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component
[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.317873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGTT GGCTTCCCAT TCTGCTGTTC GCCGCCGTGA TGACGGCGCT GCCGCTGATC 
CCTGGCATGC CGCCGTTCTG GATCGTGCTC TTGGACAATA TCGGCCTCGC CGCTCTGGTG
GCGATGGGCC TGGTGCTGCT GACCGGCGTC GGCGGCCTCA CCTCGTTCGG CCAGGCGGCG
TTCTGCGGCT TCGGCGCCTA CACCACCGCC TATCTGGCCA CGGTCTACGG CGTCTCGCCG
TGGCTGACGC TGCCGCTCTC GCTGTTGGTC GCCGGCACGG CCGCGGTGCT GCTCGGGCTG
ATGACGGTGC GGCTCAGCGG CCATTATCTG CCGCTCGGCA CCATCGCCTG GGGCATCTCG
CTGTACTACC TGTTCAGCAA GCTGGATTTT CTCGGCCGCA ATGACGGCAT CTCGCAAGTG
CCGCCGCTGT CGATCGGATC GCTGCAAATG TTCGATCCCG AGACGATCTA TTTCGTGATC
TGGGGCATGG TACTGATCAG CGCGGTGCTG ACCATGAACC TGCTCGACAG CCGCACCGGC
CGCGCCATCC GCGCGCTGCG CCGCGGGCAC ATTGCGGCGG AAGCCTTCGG CGTGCAGACC
GCGCGGGCGA AGCTGCTGGT GTTCATCTAT GCGGCGGTGC TGGCCGGCCT GTCCGGCTGG
CTCTACGCGC ACTTCCAGCG CAACGTGAAC CCGACCCCGT TCGGCCCGCA GGCCGGCATC
GAGTATCTGT TCATCGCCGT GGTCGGCGGC GCCGGCTATG TCTGGGGCGG CGTGCTCGGC
GCTGCGATCG TGATCATCCT GAAAGAAGTT CTGCAGGGCT ATCTGCCGAT GCTGTTCGGC
GGCCAGGGTC AGCTCGAGAT CATCGTGTTC GGCATCCTGC TGGTGTTGCT GCTGCAGCTC
GCGCCCGGTG GCGTCTGGCC GTGGCTGAGC AGTCTGATCC CGATCAAGAT CCGACGCTCC
CGGGTGGACG CCAGCGCCGC GCTCACGCCC CGCGAACGCA CGCCGGGCGC GACCGGCCCG
CTGCTCAAAG TCGAACGCGC GCGAAAGCAG TTCGGCGGCG TGGTCGCGGT CAACGACGTC
TCGTTCGAGG TCGATGCGCG CGAGATCGTC GCGCTGATCG GACCGAACGG CGCCGGCAAG
TCGACCACCT TCAACCTGAT CACCGGCATT CTCACCACCA CCGGCGGCAG GATCGAGCTG
CACGGCAAGC CGATCGACAA CGCGCCGCCG CAGGACGTGG TGCAGCTCGG CATCGCCCGC
ACCTTCCAGC ACGTCAAGCT GGTGCCGGAC ATGACCGTAC TGGAGAACGT CGCGATCGGC
GCGCATCTGC GCGGCAATGC CGGCGCGCTG ACCAGCATGC TGCGGCTCGA CCGCGCCGAC
GAGGCCAAGC TGCTCGCCGA AGCCACCCGG CAGATCGAGC GCGTCGGGCT CGGCGGCGAG
ATCGATCAGC TCGCAGGCAG TCTGTCGCTC GGTCAGCAGC GCATCGTCGA GATCGCGCGC
GCGCTGTGCG CCGATCCGAT GCTGCTGCTA CTCGACGAAC CCGCCGCGGG CCTGCGCCAC
ATGGAGAAAC AGCGCCTCGC CGCATTGCTG CGCCAGTTGC GCGACGGCGG CATGTCGGTG
CTGCTGGTCG AACACGACAT GGGCTTCGTG ATGGATCTCG CCGACCGCAT CGTCGTGCTG
GACTTCGGCA CGCGGATCGC CGAGGGCACG CCCGACGCGA TCAAGACCAA TCCCGAAGTC
ATCAAGGCCT ATCTCGGAGC CCTGGCATGA
 
Protein sequence
MPRWLPILLF AAVMTALPLI PGMPPFWIVL LDNIGLAALV AMGLVLLTGV GGLTSFGQAA 
FCGFGAYTTA YLATVYGVSP WLTLPLSLLV AGTAAVLLGL MTVRLSGHYL PLGTIAWGIS
LYYLFSKLDF LGRNDGISQV PPLSIGSLQM FDPETIYFVI WGMVLISAVL TMNLLDSRTG
RAIRALRRGH IAAEAFGVQT ARAKLLVFIY AAVLAGLSGW LYAHFQRNVN PTPFGPQAGI
EYLFIAVVGG AGYVWGGVLG AAIVIILKEV LQGYLPMLFG GQGQLEIIVF GILLVLLLQL
APGGVWPWLS SLIPIKIRRS RVDASAALTP RERTPGATGP LLKVERARKQ FGGVVAVNDV
SFEVDAREIV ALIGPNGAGK STTFNLITGI LTTTGGRIEL HGKPIDNAPP QDVVQLGIAR
TFQHVKLVPD MTVLENVAIG AHLRGNAGAL TSMLRLDRAD EAKLLAEATR QIERVGLGGE
IDQLAGSLSL GQQRIVEIAR ALCADPMLLL LDEPAAGLRH MEKQRLAALL RQLRDGGMSV
LLVEHDMGFV MDLADRIVVL DFGTRIAEGT PDAIKTNPEV IKAYLGALA