Gene RPB_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0038 
Symbol 
ID3909721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp37966 
End bp39186 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID637881919 
Productextracellular ligand-binding receptor 
Protein accessionYP_483661 
Protein GI86747165 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCC TGATTCAGGC CGCCACGGCG GCTGTTGCTG CGATTGTGCT CACTGCCGCC 
CCGGCTGCGG CACAGAAGAA ATACGACACC GGCGCCACCG ATACCGAGAT CAAGATCGGC
CAGACCGTGC CGTTCTCCGG TCCCGCCTCG GCCTATGCGG GCATCGGCAA GACCCAGGCG
GCCTATATGC GGATGATCAA CGATGCCGGC GGCATCAACG GCCGCAAGAT CAACCTGATC
CAGTACGACG ACGCCTATTC GCCGCCGAAA GCGGTCGAGC AGGTGCGCAA GCTGGTCGAA
AGCGACGAGG TGCTGCTGAC CTTCCAGATC ATCGGCACCC CGTCCAACGC CGCGGTGCAG
AAATATCTCA ACCAGAAGAA GGTGCCGCAA CTGCTCGCGG CGACCGGCGC GACACGGTTC
ACCGATCCGA AGAATTTCCC CTGGACGATG GGCTACAACC CGAACTACCA GACCGAGGGT
CGGATCTACG CGCGCTACAT CCTGAAGAAC CACCCCGACG CCAAGATCGG CGTGCTGTTC
CAGAACGACG ATCTCGGCCG CGACTACGTC ACCGGCCTGC GGGCCGGCCT CGGCGACAAG
GCCGACAAGA TGATCGTGGC GGAGACGTCC TATGAACTCA CCGACCCGAC CGTCGACTCG
CAGATCGTCA AGCTGAAATC CGCCGGCGCC ACGCTGCTGT ACGACGCATC GACGCCGCGC
TTCGCCGCGC AGGCGATCAA GAAAGTCGCC GATCTCGGCT GGAATCCGGT GCACATCCTC
GACATCAACG CCAGCCCGGT GTCGGCGACG CTGAAGCCCG CCGGCCTCGA CATCTCCAAG
GGCATCATCA GCGTCAATTA CGGCAAGGAC CCCGCCGACC CGCAATGGGC CGACGACCCG
GGCGTGAAGA AGTACTTCGC CTTCATGGAC AAGTACTATC CCGAGGGCGA CAAGATGAAC
ACCGTCAACA GCTACGGCTA TTCCACCGCG GAGCTGCTGA TCACCATCCT GAAGCAGTGC
GGCGACAATC TCACCCGCGA CAACATCATG AAGCAGGCCG CCAATCTGAA GAACGTCACG
CTCGACCTGT CGCTGCCGGG CATGTCGATC AATACCTCGC CGACCGACTT CCGCGTCAAC
AAGCAGTTGC GGATGATGAA GTTCAACGGC GAGCGCTGGG AGCTGTTCGG CCCGATCATT
GAGGACGACG CCGCGATGTG A
 
Protein sequence
MKSLIQAATA AVAAIVLTAA PAAAQKKYDT GATDTEIKIG QTVPFSGPAS AYAGIGKTQA 
AYMRMINDAG GINGRKINLI QYDDAYSPPK AVEQVRKLVE SDEVLLTFQI IGTPSNAAVQ
KYLNQKKVPQ LLAATGATRF TDPKNFPWTM GYNPNYQTEG RIYARYILKN HPDAKIGVLF
QNDDLGRDYV TGLRAGLGDK ADKMIVAETS YELTDPTVDS QIVKLKSAGA TLLYDASTPR
FAAQAIKKVA DLGWNPVHIL DINASPVSAT LKPAGLDISK GIISVNYGKD PADPQWADDP
GVKKYFAFMD KYYPEGDKMN TVNSYGYSTA ELLITILKQC GDNLTRDNIM KQAANLKNVT
LDLSLPGMSI NTSPTDFRVN KQLRMMKFNG ERWELFGPII EDDAAM