Gene RPB_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4204 
Symbol 
ID3912012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4776167 
End bp4777369 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content64% 
IMG OID637886107 
Productextracellular ligand-binding receptor 
Protein accessionYP_487806 
Protein GI86751310 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.669867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCATC TTTCCATTGT TGCAGCTGCA GCATTCGTCG CTGCCGCCGT CCCGGCGCTG 
ACCGCATCGG TCGCCGCTCG CGCCGACGAT CTCAAGATCG CGCTGATCTA CGGCAAGACC
GGTCCGCTGG AGGCCTACGC CAAGCAGACC GAAACCGGCC TGATGATGGG GCTCGAATAC
GCGACCAAGG GCACGATGAC GCTCGACGGC CGCAAGATCA AGGTGATCAC CAAGGACGAC
CAGAGCAAGC CCGACCTCTC CAAGGCGGCA CTTGCGGAAG CGTACCAGGA CGACGGCGTC
GACATCGCGA TCGGCACCTC GTCGTCGGCC GCAGCACTCG CCGACCTGCC GGTCGCCGAG
GAAAACAAGA AGATCCTGAT CGTCGAGCCG GCGGTAGCCG ATCAGATTAC CGGCGAGAAG
TGGAATCGCT ACATCTTCCG CACCGGCCGC AACTCGTCGC AAGATGCGAT CTCCAACGCG
GTCGCGATCG GCAAGCCGGG CGTCACCATC GCCACGCTGG CACAGGACTA CGCGTTCGGC
CGCGACGGCG TCGCCGCCTT CAAGGAGGCG CTGGCCAAGA CCGGCGCGAC GCTCGCCGCC
GAGGAATACG TCCCGACCAC CACCACCGAC TTCACCGCGG TCGGCCAACG ACTGTTCGAC
ACGCTGAAGG ACAAGCCCGG CAAGAAGATC ATCTGGGTGA TCTGGGCCGG CGGCGGCGAT
CCGCTGACCA AGCTGCAGGA CATGGACCCG AAGCGCTACG GCATCGAGCT GTCGACCGGC
GGCAACATCC TGCCGGCGCT CGCCGCCTAC AAGCGCCTGC CCGGCATGGA AGGCGCGACC
TATTACTATT ACGACATCCC CAAGAACCCG ATCAACGACT GGCTGGTGAC CGAGCATCAG
AAGCGCTTCA ACGCACCGCC GGACTTCTTC ACCGCGGGCG GCTTCTCGGC CGCGATGGCG
GTGGTCACCG CCGTGCAGAA GGCGAAGTCG ACCGACACCG AGAAGCTGAT CGCGGCGATG
GAAGGCATGG AGTTCGACAC GCCGAAGGGC AAGATGATGT TCCGCAAGGA AGATCACCAG
GCGCTGCAGA GCATGTATCA CTTCAAGGTC AAGGTCGACC CGAACGTCGC CTGGGCCGTG
CTCGAGCCGG TGCGCGAACT GAAGATCGAG GACATGAATA TCCCGATCAA GAACAAGCGG
TGA
 
Protein sequence
MRHLSIVAAA AFVAAAVPAL TASVAARADD LKIALIYGKT GPLEAYAKQT ETGLMMGLEY 
ATKGTMTLDG RKIKVITKDD QSKPDLSKAA LAEAYQDDGV DIAIGTSSSA AALADLPVAE
ENKKILIVEP AVADQITGEK WNRYIFRTGR NSSQDAISNA VAIGKPGVTI ATLAQDYAFG
RDGVAAFKEA LAKTGATLAA EEYVPTTTTD FTAVGQRLFD TLKDKPGKKI IWVIWAGGGD
PLTKLQDMDP KRYGIELSTG GNILPALAAY KRLPGMEGAT YYYYDIPKNP INDWLVTEHQ
KRFNAPPDFF TAGGFSAAMA VVTAVQKAKS TDTEKLIAAM EGMEFDTPKG KMMFRKEDHQ
ALQSMYHFKV KVDPNVAWAV LEPVRELKIE DMNIPIKNKR