Gene RPB_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1794 
Symbol 
ID3908875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2054427 
End bp2055749 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content63% 
IMG OID637883688 
Producturea short-chain amide or branched-chain amino acid uptake ABC transporter periplasmic solute-binding protein precursor 
Protein accessionYP_485413 
Protein GI86748917 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.326487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTG GATCAAAGCA TCATCAGGCG ACGCCGGTCA GCCGGCGCAA ATGGCTGGCG 
GCCGCCGCCG GTCTGGCCAT CGGCCTGTCG ACCTTCGCGC CCGCCAAGGC TGCCGACGAC
ACCATCAAGG TCGGCGTGCT GCACTCTCTG TCCGGCACCA TGGCGATCAG CGAAACCACG
CTGAAGGACA CGGTGCTGTT CCTGATCGAC GAGCAGAACA AGAAGGGCGG CGTGCTCGGC
AAGAAGCTCG AGGCGGTCGT GGTCGATCCG GCGTCGAACT GGCCGCTGTT CGCCGAGAAG
GCCCGCGAGC TGATCACCAA GGACAAGGTC TCGGTGGTGT TCGGCTGCTG GACCTCGGTG
TCGCGCAAAT CGGTGCTGCC GGTGTTCAAG GAGCTGAATT CGATCCTGTT CTATCCGGTG
CAGTACGAAG GCGAGGAGAG CGAGCGCAAC GTGTTCTACA CCGGCGCCGC GCCGAACCAG
CAGGCGATCC CGGCGGTCGA TTATCTCGCC AAGGAAGAAA AGGTGCAGCG CTGGGTGCTG
GCCGGCACCG ACTACGTCTA TCCGCGCACC ACGAATAAGA TTCTGGAAGC CTACTTGAAG
TCGAAAGGCG TCAAGCAGGA AGACATCATG ATCAATTACA CGCCGTTCGG TCATTCCGAC
TGGCAGACCA TCGTCGCCGA CATCAAGAAG TTCGGCTCGG CCGGCAAGAA GACCGCCGTG
GTCTCGACCA TCAACGGCGA CGCCAACGTG CCGTTCTACA AGGAACTCGG CAACCAGGGC
ATCAAGGCCA CCGACATTCC GGTGGTGGCG TTCTCGGTCG GCGAAGAAGA ACTCGCCGGC
ATCGACACCA AGCCGCTGCT CGGCCATCTC GCCGCCTGGA ACTACTTCCA GTCGATCAAG
GACCCCGAGA ACGAGAAGTT CATCAAGGCC TGGCAGGCCT ACACCAAGAA CCCGAAGCGC
GTGACCAACG ACCCGATGGA AGCCACCGTG ATCGGCTTCA ACATGTGGGT GAAGGCGGTC
GAGAAGGCCG GCACCGTCGA CGCCGACAAG GTGATCGACA CGCTGCCGGG CACCAAGGCG
CCGAACCTGA CCGGCGGCAC CTCGGAAATG CTGCCGAACC ACCACATCAC CAAGCCGGTG
TTCATCGGCG AGATCAAGGG TGACGGCCAG TTCGACGTGG TCTGGAAGAC CCCGGGCCTG
GTCCCCGGCG ACGCCTGGTC GAAGGAGCTC GACGGCTCCA AGGACCTGAT CGGCGACTGG
GTGACGCTGA AGTGCGGCAA CTACAACACC GTGACCAAGA AGTGCGGCGG CCAGGGGACC
TGA
 
Protein sequence
MRIGSKHHQA TPVSRRKWLA AAAGLAIGLS TFAPAKAADD TIKVGVLHSL SGTMAISETT 
LKDTVLFLID EQNKKGGVLG KKLEAVVVDP ASNWPLFAEK ARELITKDKV SVVFGCWTSV
SRKSVLPVFK ELNSILFYPV QYEGEESERN VFYTGAAPNQ QAIPAVDYLA KEEKVQRWVL
AGTDYVYPRT TNKILEAYLK SKGVKQEDIM INYTPFGHSD WQTIVADIKK FGSAGKKTAV
VSTINGDANV PFYKELGNQG IKATDIPVVA FSVGEEELAG IDTKPLLGHL AAWNYFQSIK
DPENEKFIKA WQAYTKNPKR VTNDPMEATV IGFNMWVKAV EKAGTVDADK VIDTLPGTKA
PNLTGGTSEM LPNHHITKPV FIGEIKGDGQ FDVVWKTPGL VPGDAWSKEL DGSKDLIGDW
VTLKCGNYNT VTKKCGGQGT