Gene RPB_3877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3877 
Symbol 
ID3911681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4431943 
End bp4433163 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID637885778 
Producturea/short-chain binding protein of ABC transporter 
Protein accessionYP_487481 
Protein GI86750985 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.188449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCA AACCATTGGC GGTGGCGATA ATGACGGCCG CATGGCTGTC ACCGTCGGCC 
GCGTTCGCAC AAGCCTCGCC CCAGATTTCC GACGACGTCG TCAAGATCGG CGTGCTCACC
GACATGAACG GCCCGGCCTC GACGCCGACC GGCCAGGGCT CGGTCACCGC CGCGCAGATG
GCGGTCGAGG ATTTCGGCGG CAGCGTGCTC GGCAAGCCGA TCAGCATCAT CGTCGGCGAT
CATCAGCTCA AGCCCGATAT CGGCGCCGCC CTGGCGCGGC GCTGGTACGA CGTCGAACAG
GTCGATCTGA TCGTCGACGT GCCGGTCTCC GCCGTCGGCC TCGCGGTGCA GAACATCGCG
GGCGAGAAGA AGCGGATGTT CATCACCCAT TCGACCGGCG CCGCCGATTT TCACGGCAAG
TTCTGCTCGC CTTACGCGAT GCAATGGGTG TTCGACACCC GCGCGCTCGC GGTCGGCACC
GCGCAGGAAG TGGTCAAGCG CGGCGGCGAC AGTTGGTTCT TCATCACCGA CGACTACGCT
TTCGGCCAGT CGCTGGAGCG CGACGCCGCC GCGGTCGTCA CCAAGTCCGG CGGCAAGGTG
CTGGGCGCGG TGCGGCCTCC TTTCGCGACG CCGGATCTAT CGTCCTTCGT GCTGCAGGCG
CAGGCCTCGA AGGCCAAGAT CATCGGCATC GCCGGCGGCC CGCCGAACAA TATCAATGAA
ATTAAAACCG GCGCCGAATT CGGCGTGTTC AAGGGAGGCC AGCAGATGGC GGCGCTGCTG
GCGCTGATCA CCGACATCCA CTCGCTCGGC CTGCCGGCCG CGCAAGGCCT GCTGCTGACG
ACGTCGTTCT ATTGGGACAT GGACGACAAG ACCCGGGAAT GGTCGAAGCG CTATTTCGCC
AAGATGAACC GGATGCCGAC GATGTGGCAG GCCGGCGTGT ATTCGTCGAC CATGCACTAT
CTGCAGGCGA TCAAGGACGC CGGCACCGAC GAGCCGCTGC AGGTCGCGGC CAAGATGCGC
GAGAAGCCGA TCGAGGATTT CTTCTCCCGC AACGGCCGGC TGCGCGAGGA CGGGTTGATG
GTGCACGATC TGATGCTGGT GCAGGTGAAG TCCCCGGAAG AGTCGAAATA TCCATGGGAC
TATTACAAGA TCCTCGCCAA AATCTCCGGC GCCGAGGCGT TCGGTCCGCC CGACCCGGCC
TGCCCGCTGG TCAAGAAATA G
 
Protein sequence
MVAKPLAVAI MTAAWLSPSA AFAQASPQIS DDVVKIGVLT DMNGPASTPT GQGSVTAAQM 
AVEDFGGSVL GKPISIIVGD HQLKPDIGAA LARRWYDVEQ VDLIVDVPVS AVGLAVQNIA
GEKKRMFITH STGAADFHGK FCSPYAMQWV FDTRALAVGT AQEVVKRGGD SWFFITDDYA
FGQSLERDAA AVVTKSGGKV LGAVRPPFAT PDLSSFVLQA QASKAKIIGI AGGPPNNINE
IKTGAEFGVF KGGQQMAALL ALITDIHSLG LPAAQGLLLT TSFYWDMDDK TREWSKRYFA
KMNRMPTMWQ AGVYSSTMHY LQAIKDAGTD EPLQVAAKMR EKPIEDFFSR NGRLREDGLM
VHDLMLVQVK SPEESKYPWD YYKILAKISG AEAFGPPDPA CPLVKK