Gene RPB_4662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4662 
Symbol 
ID3912480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5273404 
End bp5274612 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID637886567 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_488256 
Protein GI86751760 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.841519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGA TCCGACGTTT GCCGCCGCTA TCCCGGCGTG CGCTGCTGAC CGGCGCCGCG 
GGCGCCGCCA CCATTGCAGT CGCGCCGCGC TTCGCCACGC CCGCCATCGC GCAGACCTCG
CCGCTCAAGG TCGGGCTGAT GCTGCCCTAC ACCGGCACGT TCGCGAAGCT CGGCCAGTTC
ATCGACGACG GCTTCCGGCT GCGCGTCGAG CAGGCCGGCG GTAAGCTCGG CGGGCGTGAT
GTGACCTTCG TGCAGGTCGA CGACGAGTCC AAGCCGGAGG CCGCCACCGA CAACATGAAT
CGCCTGGTCG GCCGTGAGAA GGTCGACGTC GTGGTCGGCA CCGTGCATTC CGGCGTCGCG
ATGGCGATGG TCAAGGTCGC GCGCGATAGC GGCACGCTGC TGATCATTCC CAACGCCGGC
GCCAACGACG CCACCGGACC GGCCTGCGCG CCGAACATCT TCCGCACCTC GTTCTCGAAC
TGGCAGACCA CCTTCCCGAT GGGCAAGGTG ATGGCGGACG CGGGCATCAA GAATGTCGTC
ACCATCACCT GGAAGTACAC CGCCGGCGCC GAAATGGTCG GCGCCTTCGC GGAGAACTTC
ACCAGGAACG GCGGCAAGAT CGTCGAGGAT CTGACGCTGC CGTTCCCGCA GGTCGAATTC
CAGGCGCTGA TCACGCGCAT CGCGCAGCTC AAACCCGACG CGGTGTTCAG CTTCTTCGCC
GGCGGCGGCG CGGTGAAATT CGTCAAGGAC TACGCCGCGG CGGGCCTCAA CAAGACGATT
CCGCTGTATG GCGCGGGCTT TCTCACCGAC GGCACCATCG AGGCGCAGGG CGAGGCGGCC
AACGGGATCA AGACGACGCT GCACTACGCC GACAATCTCG ACAACCCCGC CAACGTCGCC
TTCCTCAAGG CGTTCAAGGC CAAGACCCAG AAGGACGGCG ACATCTACGC GGTGCAGGGC
TTTGACGCCG CCGCGCTGCT CGATATCGGC CTCGGCGCGG TGAAGGGCGA TGCCGGCGCG
CGCGACACGA TGATCAAGGC GATGGCGGCG GCCAAGATCG ACAGTCCGCG CGGGCCGCTG
TCGTTCAACA AGGCGCACAA CCCGATCCAG AATATCTATC TGCGCGAGGT GAAGAACGGC
CGCAACGAAA TGGTGTCGAT CGCGCAAGCC GCTGTCGACG ACCCGGCGCG CGGCTGCAAG
ATGACGTGA
 
Protein sequence
MSTIRRLPPL SRRALLTGAA GAATIAVAPR FATPAIAQTS PLKVGLMLPY TGTFAKLGQF 
IDDGFRLRVE QAGGKLGGRD VTFVQVDDES KPEAATDNMN RLVGREKVDV VVGTVHSGVA
MAMVKVARDS GTLLIIPNAG ANDATGPACA PNIFRTSFSN WQTTFPMGKV MADAGIKNVV
TITWKYTAGA EMVGAFAENF TRNGGKIVED LTLPFPQVEF QALITRIAQL KPDAVFSFFA
GGGAVKFVKD YAAAGLNKTI PLYGAGFLTD GTIEAQGEAA NGIKTTLHYA DNLDNPANVA
FLKAFKAKTQ KDGDIYAVQG FDAAALLDIG LGAVKGDAGA RDTMIKAMAA AKIDSPRGPL
SFNKAHNPIQ NIYLREVKNG RNEMVSIAQA AVDDPARGCK MT