Gene RPB_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0100 
Symbol 
ID3909686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp107004 
End bp108446 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID637881981 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_483723 
Protein GI86747227 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCA AGCGACTATT CGCCCGGTCG ATGGACGAGC CCGAGCCGGT GGACCCTTTG 
CTGCATATGT CTGCGCGCCC GGTGCGACGC AAGACGACGC TGATTTCGCT GGCCGCGATC
CTGGCGCTGG TCGGCGCCAT CAGTGCTGCG TATTATTTCG CGATGCGGCC GACGACGCTG
AAGATCGCCG TCGGTCCGCA GGCCAGCGAC GATCTGCGGC TGATCCAGGC GCTGGCGCAG
GCGTTCTCGC GCGAGCGCAA CATCGTCCGG ATGCGGCCCA TCGTCACCGA CGGCCCGGCC
GCCAGCGCCG CGGCGCTGAA ATCAGACACC ACCGATCTTG CGGTGATCCG CGGCGATCTG
CCGGTGCCGC GCAACGCCCG CTCGGTCGCG GTGCTGCACA AGAACGTCGC CGTGCTGTGG
GCGCCCGGCC GGCCCGCGAG CGGCAAACGC AGGAAAGCCG CTGTCGCCGG GGTCACTACC
ATCACCCAGC TGGTCGGCAA GCGCGTCGGC GTGATCGGCC GCACCGAAGC CAATGCGGGG
CTGCTCGCGG TGATCCTGCG CCAATACGGC ATCGACCCCG CCAAGGTCGA GAGCGTGCAG
CTCACCGCGG CCGACGTCGC AGAGGCGGTG CGGACCGGCA AGGCCGACGC GTTTCTCGCG
GCCGGACCGC TCAACAGCAA GGTGATCGGC GAGGCTTTGG CGGCCACTGC GAGTTCCGGC
CGGGAGCCGG TGTTCCTCGG CATCGATTCG TCCGAAGCGC TGGCCGCCAA CCATCCGTCG
TATGAATCGG CGTCGATCCC CGCCGGCGCC TTCGGCGGCG CCCCGGCGCG GCCGGGCGAC
GACGTCAAGA CCATCAGCTT CTCGCATTAC ATCGTGGCGC GCGACGGCGT CTCCGACGCC
ACCATCGCGA GTTTCACCCA ACAATTGTTC ACGGCGCGCC AGACCGTGAT GACGGAGAAT
CCGCTGGCCG CGAAGATCGA GACGCCCGAC ACCGACAAGG ACGCGGTGAT CCCGGTGCAG
GCGGGTGCCG CTGCCTATGT CGACGGCGAG CAGCGCAGCT TTCTCGACCG CTACAGTGAC
CTGATCTGGT TTTCGCTGAT GGGGCTGTCG GCGACCGGCT CGCTCGGCGC CTGGTTCGCG
AGCTATCTGC GGAAAGACGA ACGCAACACC AACGCCTCGC AGCGCGACCG GCTGCTCGAC
ATGCTGGCGG CGGCGCGGCG CTGCGACGCG CAGGACGAAC TCGACGCGAT GCAGACCGAG
GCCGATGCGA TCCTGCGCGA CGCGCTGAAC TGCTACGAGA ACGGCGCGAT CGACAGCGCG
GCGCTGACCG CCTTCAGCAT CGCGCTGGAG CAGTTCCACA ACGCCGTGGT CGATCGCAAG
ATGCTGCTGG CGGCGATCCC GCCGGCGCCA CCGATCCGCC CGGCGCGGCC GCAGGTGGTG
TGA
 
Protein sequence
MDFKRLFARS MDEPEPVDPL LHMSARPVRR KTTLISLAAI LALVGAISAA YYFAMRPTTL 
KIAVGPQASD DLRLIQALAQ AFSRERNIVR MRPIVTDGPA ASAAALKSDT TDLAVIRGDL
PVPRNARSVA VLHKNVAVLW APGRPASGKR RKAAVAGVTT ITQLVGKRVG VIGRTEANAG
LLAVILRQYG IDPAKVESVQ LTAADVAEAV RTGKADAFLA AGPLNSKVIG EALAATASSG
REPVFLGIDS SEALAANHPS YESASIPAGA FGGAPARPGD DVKTISFSHY IVARDGVSDA
TIASFTQQLF TARQTVMTEN PLAAKIETPD TDKDAVIPVQ AGAAAYVDGE QRSFLDRYSD
LIWFSLMGLS ATGSLGAWFA SYLRKDERNT NASQRDRLLD MLAAARRCDA QDELDAMQTE
ADAILRDALN CYENGAIDSA ALTAFSIALE QFHNAVVDRK MLLAAIPPAP PIRPARPQVV