Gene RPB_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0222 
Symbol 
ID3909464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp250038 
End bp251102 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID637882104 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_483844 
Protein GI86747348 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAG CCGCAGAACT CAGCCACGGC CTCGAGCCGA TCGAGGATAT CGGCGGCGCC 
GCGCAGCCGC TGCTGGACGT CAGAGGTCTC ACCAAGCACT TCCCGGTGCG CGGCGGGCTG
TTCAGCGCCG CCAAGACGGT GCGGGCGGTC GACGACGTTT CGTTCGCGAT CGCCAAGGGC
GAAACCGTCG GCATCGTCGG CGAATCCGGC TGCGGCAAGT CGACCACGGC GCGGCTGTTG
ATGCATCTGA TGCCGCGCAA CGCCGGCGAC ATCATCTATG ACGGCCGCGC CGTCGGCCGC
GAATTATCGC TGCGCGAGCT GCGGCGCGGG ATGCAGATGG TGTTTCAGGA CAGCTACGCG
TCGCTCAACC CGCGCCTCAC CATCGAGGAG TCGATCGCGT TCGGTCCGAA GGTGCACGGC
ATGGCCGATG GCACCGCGCG AGCGCTGGCG CGCGAGCTGC TCGGCAAGGT CGGGCTGCGG
CCGGAGAATT TCGCCAATCG CTACCCGCAC GAAGTCTCCG GCGGCCAGCG CCAGCGCGTC
AACATCGCCC GCGCGCTGGC GCTGTCGCCG CGGCTGGTGA TCCTCGACGA AGCGGTGTCG
GCGCTCGACA AATCGGTCGA GGCGCAGGTG CTGAATCTGC TGGTCGACCT CAAGCGCGAA
TTCGGCCTGA CCTATCTGTT CATCAGCCAC GATCTCAACG TGGTGCGCTA CATCTCGGAT
CGCGTGCTGG TGATGTATCT GGGCGAAGTC GTCGAGCTCG GGCCGGTCGA CAAGGTCTGG
GATCAGCCGG CGCATCCCTA TACGCGCGCG CTGCTGGCGG CGATGCCGTC GTCGGATCCC
GACCGCCGCA CCGAAGTGCC GCCGATTTCC GGCGATCCGC CGAACCCGAT CGATCCGCCG
TCCGGCTGCC GGTTTCACAC CCGCTGCCCG TTCGCGGAGC CTTTGTGCGG CGGCGACGCG
CCGAAGCTGA CCGCGCTGGA CCCGAGCGGC CACCAGGCGG CATGCTACAT GGCGATCCCC
GGCTCGGGCC ACAGCCGGGC GCCGAAAGTG ATGGAGACGA CATGA
 
Protein sequence
MTQAAELSHG LEPIEDIGGA AQPLLDVRGL TKHFPVRGGL FSAAKTVRAV DDVSFAIAKG 
ETVGIVGESG CGKSTTARLL MHLMPRNAGD IIYDGRAVGR ELSLRELRRG MQMVFQDSYA
SLNPRLTIEE SIAFGPKVHG MADGTARALA RELLGKVGLR PENFANRYPH EVSGGQRQRV
NIARALALSP RLVILDEAVS ALDKSVEAQV LNLLVDLKRE FGLTYLFISH DLNVVRYISD
RVLVMYLGEV VELGPVDKVW DQPAHPYTRA LLAAMPSSDP DRRTEVPPIS GDPPNPIDPP
SGCRFHTRCP FAEPLCGGDA PKLTALDPSG HQAACYMAIP GSGHSRAPKV METT