Gene RPB_3339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3339 
Symbol 
ID3911141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3819862 
End bp3821094 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content63% 
IMG OID637885242 
Productextracellular ligand-binding receptor 
Protein accessionYP_486946 
Protein GI86750450 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.154352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.135284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATGA CGACGACTTC CGTGGCCGCG TTCGCGGCTG CGATCGCGAT GCTGGCCGCG 
AGCCCGGCAG CGGCCCAGAA GAAATACGGC CCCGGCGCCA GCGACACCGA GATCAAGCTC
GGCAACACAG TGCCCTATAG CGGCCCAGCC TCGGCCTACG GCATTCTCGG CAAGACCTAT
GCCGCGTATT TCGCAAAGAT CAACGAGGAA GGCGGCATCA ACGGCCGCAA GATCGTCCTG
ATCTCTTATG ACGACGCCTA TTCGCCGCCG AAGACCGTGG AACAGACCCG CAAGCTGGTC
GAAAGCGACG AGGTGCTGGC GATCGTCGGC AATGTCGGTA CCGCCTCCAA CATCGCGATC
CAGAAATATC TGAACGCCAA GAAGACCCCG CAATTGTTTC TCGCCACCGG CGCGACGCGC
TGGAACGATC CGAAGCAGTT TCCGTGGACC ATGGGCTGGC TGCCGAGCTA CCAGGCCGAG
GCCACGGCCT ATGCGAAATA TCTGCTGAAG GAGAAGCCCG ACGCCAAGAT CGGCGTGTTC
TACCAGAACG ACGATTTCGG CAAGGACTAC GTGCGCGGCC TGAAGGAGGG GCTCGGCGAC
AAGGCGGCGA CGATGATCGT CGCCGAATCC AGCTACGAGG TCTCCGAGCC GACGGTGGAT
TCCCACATCG TCAAGCTGAA GGCGGCCGGC GCCGACACGC TGCTCACCTT CGCGACCGGC
AAGTTCGCCG CGCAGGCGAT CAAGAAGGTC GCCGAACTCG GCTGGAAGCC GCTGCACATC
GTGCCCAACG CCAGTTCGTC GCTCGGCAGC GTGCTGCGCC CGGCCGGCCT CGACAATGCG
CAGGACCTGG TGTCCGCGAC CTTCGCCAAG GACCCGACCG ATCCGCAGTG GAACGAGGAT
CCGGGGATGA AGAAATTCCA CGCCTTCGTC GAGAAATACA TTCCCGAAGG CAAGGCGATG
GAGAGCACCG TGCTGTCCGG CTACAGCATC GCCCAGACCA TGGCGGAGGC GCTGCGGATG
TGCGGCGATG ATCTGACCCG CGACAACCTG ATGAAGCAGG CGGCGAACAT GAAGGACGTC
AAGCTCGACG GCCTCTTGCC GGGCGTCACC GTCAACACCA GCGCCACCGA CTTCGCGCCG
ATCGACCAGT TTCAGATGAT GGTGTTCAAG GGCGAGCGCT GGGGGCGGTT CGGCGACGTC
ATCAAGGGCG AACTGGCCGT GGCCGGACGG TAA
 
Protein sequence
MRMTTTSVAA FAAAIAMLAA SPAAAQKKYG PGASDTEIKL GNTVPYSGPA SAYGILGKTY 
AAYFAKINEE GGINGRKIVL ISYDDAYSPP KTVEQTRKLV ESDEVLAIVG NVGTASNIAI
QKYLNAKKTP QLFLATGATR WNDPKQFPWT MGWLPSYQAE ATAYAKYLLK EKPDAKIGVF
YQNDDFGKDY VRGLKEGLGD KAATMIVAES SYEVSEPTVD SHIVKLKAAG ADTLLTFATG
KFAAQAIKKV AELGWKPLHI VPNASSSLGS VLRPAGLDNA QDLVSATFAK DPTDPQWNED
PGMKKFHAFV EKYIPEGKAM ESTVLSGYSI AQTMAEALRM CGDDLTRDNL MKQAANMKDV
KLDGLLPGVT VNTSATDFAP IDQFQMMVFK GERWGRFGDV IKGELAVAGR