Gene RPB_0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0141 
Symbol 
ID3908112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp154064 
End bp155284 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID637882023 
Productextracellular ligand-binding receptor 
Protein accessionYP_483764 
Protein GI86747268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.202322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCACG CGTGGGTTGC TCGCTGCGTT CGCCTCTCGA TCCCGCTCGG CATTGCCGCG 
CTGGCATCCG GAATCCAGAT ATCCGCAGCG TCGGCCGACG ACGTCATCAA GATCGGCGCG
CCGCTGCCGA TCACCGGGCC GCTGGCGCCG GAAGCCATCA AGCAGCAGCA GGGCTACAAT
CTGTGGGCCG AACAGGCCAA CAAGGCCGGC GGCATTTCGG TCGGCGGCAA GAAGTACAAG
GTCGAGATCG TCTACACCGA CTACCAGTCG AACACGCCGC GCGCGGTGCA GGCGACCGAG
CAGCTGATCA CCCAGAACAA CGTCAACTTC GTGTTCTCGC CGTTCGGCTC CGGTGCGGCG
AAGGCGGCCA GCACGGTGTC GGAAAAGCAC AAGGTGCCGA CGCTGGCGGC GACCGCCTCG
TCGTCCCAGG TCTACGACCA GGGCTACAAA TATCTGTTCG GCACCTTCAC CCCGAACGAC
ACCCTGACCA CGCCGCTGAC CGAAATGATC AAGGCCAAGG TGCCCGAGGT CAAGAAGGTC
GCGATCCTCG CCCGCAACGA TCTGTTCCCG CTGGCGATCG CGCAGGAGAT GGAGAAGTCG
GCCAAGGCCA ACGGCCTCGA GGTGGTGTAT TTCGAGAAAT ACGCGATCGG CACGCTCGAC
CATTCCGCCA CGCTGTCGTC GATCAAGGCG CAGTCGCCGC AGTGGATCTT CGTCACCGGC
TACACCAACG ACCTGCTGCT GGTGCGCAAG CAGATGATCG ACCAGCAGAT GAAGGCCCCG
GTGGTCTCGA TGATCGCCGG CCCGGCCTAT CAGGAGTTCA TCGACGCGCT CGGCAAGGGG
GCCGAGAACG TCTCGAGCGC CGCCTGGTGG CATCCGGCCG CGCGCTATGA CGGCAAGGAC
ATCTTTGGCT CCACCGCCAA TTTCGTGAAG CTGTTCAAGG ACAAGTACAA CGCCGAACCG
GACTACGCGC ATGCTTCGGC GGCGCTGTGC GGCGCGCTGT TCCAGATCGC GATCGAGAAG
GCCGGTTCGA TCGATCGCGA CAAGGTGCGC GACGAACTCG CCAAGATGGA CGTCGTCACC
TTCTTCGGCC CGGTCAAGTT CGGCGCCAAC GGCCAGATCA ACTCGCTCGA CCCGCCGGTC
TTCCAGATCC AGGGCGGCAA GCCGGTGGTG CTGTTCCCGC AGGCGATCAA GCAAGGCGAC
CTCAAGATCG GCCTCGAGTA A
 
Protein sequence
MLHAWVARCV RLSIPLGIAA LASGIQISAA SADDVIKIGA PLPITGPLAP EAIKQQQGYN 
LWAEQANKAG GISVGGKKYK VEIVYTDYQS NTPRAVQATE QLITQNNVNF VFSPFGSGAA
KAASTVSEKH KVPTLAATAS SSQVYDQGYK YLFGTFTPND TLTTPLTEMI KAKVPEVKKV
AILARNDLFP LAIAQEMEKS AKANGLEVVY FEKYAIGTLD HSATLSSIKA QSPQWIFVTG
YTNDLLLVRK QMIDQQMKAP VVSMIAGPAY QEFIDALGKG AENVSSAAWW HPAARYDGKD
IFGSTANFVK LFKDKYNAEP DYAHASAALC GALFQIAIEK AGSIDRDKVR DELAKMDVVT
FFGPVKFGAN GQINSLDPPV FQIQGGKPVV LFPQAIKQGD LKIGLE