Gene RPB_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1740 
Symbol 
ID3909727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1987652 
End bp1988959 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID637883634 
Productextracellular ligand-binding receptor 
Protein accessionYP_485359 
Protein GI86748863 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.398941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.157953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTCG GCGCGCGATC CGGCATATAC AGGGCGGATT CCGCCGCGCC ACGCGGCCCG 
ATGCGAGACC CGATCATGCT TTCCGATGCC TTCCGGTGCA CGCGATGGAC CGCGTTGCTC
ACCGTTCTCT TCGCCGTCGT CGCTGCGTCG CCGGCGCTTT CCCAGAAGCG CTACGATCCC
GGCGCCAGCG ACACCGCGAT CCGGATCGGC AATCTGATGC CCTATTCCGG CCCGGCCTCG
GCCTATGCCA TCGTCGGCCG CATCGAGCAG GCCTATTTCC GGATGATCAA CGACCAGGGC
GGCATCAACG GCCGCCGGAT CGAGTTCATT TCCTATGACG ACGCCTACAG CCCGGCCAAG
GCGGTGGAGC AGACGCGCAG ACTGGTCGAA AGCGACGAGG TGCTGCTGGT GTTCAGCGCG
ATGGGCACGC CCTCGAACAC CGCGATCCAG AAATATCTCA ACGCCAAGGG CGTCCCGCAA
CTGTTCGCCG CGAGCGGCGC GACGCGGTTC GGCGACCCGA AGGGCTTTCC CTGGACGATG
GGCTGGCAGC CGCCCTACCA GGTCGAGGGC CGCGTCTACG CCAAATACAT CCTCGCCAGC
AGGCCCGAGG CCCGGATCGC GGTGCTGTAT CAGAACGACG ACCTCGGACG CGACCTGCTG
AAGGGGCTGA AGGACGGGCT CGGCGACCAC GCGACGCAGA TCGTCGCCGA GGAGAGCTAC
GAAGTCGCCG AGCCTTCCGC CGATAACCAC ATCGCCCGGC TGAAGGCGTC GGGCGCCGAC
GCGTTCGTCA GCATCACCAC GCCGAAATTC GCGGCGCAGA GCATCCGCAC GGCCGCCGAG
ATGCAATGGC GTCCGCTGTA TCTGCAGGCG CTGGTGTCGG CCTCGATCGG CGCGGTGCTG
CGGCCGGCCG GGCTCGACCA CGCGCAGGGA CTCATTTCCG CGGCCTACAA CAAGGACGCC
GCCGATCCGC AATGGACCGA CGACCCCGGC ATGAAGCGGT TCCATGCCTT CCTCGATACC
TACGCGCCGG ACGTCAACCG CGGCGACAAT TCGGTGATCT ACGGCTACGG TGCGGCGCAA
TGCCTCGTCG AGGTCCTGCG CCGTGCCGGC GACACGCTGA CGCGCGCCAA TGTGATGCGC
GAAGCGGCCA GTCTCGAAGG CTACGCGCCC GACACGCTGC TGCCGGGCAT CACCATCACC
ACCGCGGCGA ACGACTTTCA TCCGATCGAA CAGCTGCGCT TGATGCGTTT CGAGGGCGAC
CACTGGCGCT TGTTCAGACC GGTGATCGAC GCCGACCTGC GCAACTGA
 
Protein sequence
MGLGARSGIY RADSAAPRGP MRDPIMLSDA FRCTRWTALL TVLFAVVAAS PALSQKRYDP 
GASDTAIRIG NLMPYSGPAS AYAIVGRIEQ AYFRMINDQG GINGRRIEFI SYDDAYSPAK
AVEQTRRLVE SDEVLLVFSA MGTPSNTAIQ KYLNAKGVPQ LFAASGATRF GDPKGFPWTM
GWQPPYQVEG RVYAKYILAS RPEARIAVLY QNDDLGRDLL KGLKDGLGDH ATQIVAEESY
EVAEPSADNH IARLKASGAD AFVSITTPKF AAQSIRTAAE MQWRPLYLQA LVSASIGAVL
RPAGLDHAQG LISAAYNKDA ADPQWTDDPG MKRFHAFLDT YAPDVNRGDN SVIYGYGAAQ
CLVEVLRRAG DTLTRANVMR EAASLEGYAP DTLLPGITIT TAANDFHPIE QLRLMRFEGD
HWRLFRPVID ADLRN