Gene RPB_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2110 
Symbol 
ID3908524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2400635 
End bp2401795 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID637884003 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_485727 
Protein GI86749231 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.417803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0323515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGAC GGTTCGGTCC GCTGGCGTCC ATCGCCGTCA CGTGCGCGAC TGTGATTGCA 
ACTCCAGCGC GTGCCGACGA AGCGAAACTG CCGGCGACGT TGGCGTTTAC CGCCTATGAC
ACCGGTACCG CGGGCTTCAA CATCGCGGTC GCCGTCGGCA AGATGATGAA GGACAAGCTC
GCCACCGACG TGCGCGTGCT GCCGGCCGGC AACGACGTCG CCCGCCTCGC GCCGCTGCGC
GCCGGCCGCG CGCAACTGGC GTGGATGGGA TCTGGCACCT ACTTCGCGCA GGAAGGCGTG
TTTGAATTCG GCGCCAAGGA ATGGGGCCCG CAGCCGCTGC AGATCACCCT CAGCGTGGTC
GATTGCAACG GCTCGTCGCT CGGCGTCGCC AAGGACGCCG GCGTCAAGGA GATCAAGGAC
CTGCGCGGCA AGCGCGTCGG CTTCGTGGTC GGCTCGCCGG CGCTGAACCA GAACGCGCTC
GGCATCATCG CCTTCGGCGA TCTCACCCAG AAGGACGTCA AGATCGTCGA ATTCGCCAGC
TACGGCGCGA TGTGGAAAGG CCTTGTCAAC AACGACGTCG ACGCCGCCTT CGGCACCACC
ATCACCGGGC CGGCGAAGGA AGCCGAGAAC TCGCCGCGCG GGCTGATCTG GCCGCCGATG
CCGCACGCCG ACAAGGCGGC GTGGGAGCGC GTCAAGAAGG TCGCGCCGTT CTTCAATCCG
CACATGTCGA CCTGTGGCGC CGCGCATCAG CCGGGCAAGC CGATCGAACT GTCGAACTAC
CCCTACCCGA TCGCGACCGT CTATGCGACG CAGCCGGCGG ATCAGGTCCA CGCCATCACC
AAGGCGATGA TCGTGAACTA CGACGTCTAC AAGGACAGCG CCCCGGGCGC CACCGGCCTC
GCGGTGAAGA CCCAGACCAT GAAATGGGTG GTGCCGTTCC ACCCCGGCGC GGTGAAGGCG
CTGCAGGAGG CCGGGCAATG GAGCCCCGAG GATCAGGCGC ACAATGACGC CCTGATCAAG
CGCCAGGCCG TGCTCGCCGC AGCATGGAAG GACTATACCG CCTCCGCGCC GTCCGGCGAC
AAGGAGTTCG TCGACGGCTG GACGGCCGCG CGCGCCGCGG CGCTGCAGAA GGCGAAGATG
CCGAACGGAT TCGCGCAGTA G
 
Protein sequence
MIRRFGPLAS IAVTCATVIA TPARADEAKL PATLAFTAYD TGTAGFNIAV AVGKMMKDKL 
ATDVRVLPAG NDVARLAPLR AGRAQLAWMG SGTYFAQEGV FEFGAKEWGP QPLQITLSVV
DCNGSSLGVA KDAGVKEIKD LRGKRVGFVV GSPALNQNAL GIIAFGDLTQ KDVKIVEFAS
YGAMWKGLVN NDVDAAFGTT ITGPAKEAEN SPRGLIWPPM PHADKAAWER VKKVAPFFNP
HMSTCGAAHQ PGKPIELSNY PYPIATVYAT QPADQVHAIT KAMIVNYDVY KDSAPGATGL
AVKTQTMKWV VPFHPGAVKA LQEAGQWSPE DQAHNDALIK RQAVLAAAWK DYTASAPSGD
KEFVDGWTAA RAAALQKAKM PNGFAQ