Gene RPB_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1919 
Symbol 
ID3907998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2192585 
End bp2194234 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID637883813 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_485538 
Protein GI86749042 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCC AGTTCGTTTA CTTCATGCAG GGCCTGACCA AGGCCTATCC GACCCGCAAG 
GTGCTGGATA ACGTCCATTT GTCGTTCTAT CCCGACGCCA AGATCGGCGT GCTCGGCGTC
AACGGCGCCG GCAAATCGAC CCTGCTGAAG ATCATGGCGG GGATCGACAA GGAATACACC
GGCGAGGCCT GGGTCGCCGA CGGCGCCCGC GTCGGCTATC TCGAACAGGA ACCGCAGCTC
GATGCCGCGC TGAACGTGCG CGAGAACGTC ATGCTCGGCG TCGCCAAGCA GAAGGCGATC
CTCGATCGCT ACAACGAGCT GGCGATGAAC TACTCCGAGG AAACCGCCGA CGAGATGACG
GCGCTGCAGG ACCAGATCGA GTCCGCCGGG CTGTGGGATC TCGACAGCCA GGTCGATCAG
GCGATGGACG CGCTGCGCTG CCCGCCGGAC GACGCCGACG TCACCAAGCT GTCGGGCGGC
GAGCGCCGCC GCGTCGCGCT GTGCAAGCTG CTGCTCGACC GGCCCGAACT GCTGCTGCTG
GACGAGCCGA CCAACCATCT CGACGCCGAG AGCGTGTCGT GGCTCGAGAA CCATCTGCGC
AATTATCCGG GTGCGATCCT GATCGTCACC CATGATCGCT ACTTCCTCGA CAACGTCACG
TCCTGGATCC TCGAACTCGA CCGCGGCAAG GGCATTCCCT ACGAGGGCAA TTACTCGTCC
TGGCTGGTGC AGAAGCAGAA GCGGCTGCTG CAGGAGGGGC GCGAGGACGC GGCCCATCAG
AAGACGCTGG AGCGCGAGCA GGAGTGGATC GCCTCATCGC CCAAGGCTCG GCAGGCCAAG
TCCAAGGCGC GCTACCAGCG CTACGATGAA CTGCTCGCCA AGGCCAGCGA GAAGCAGACC
CAGACCGCGC AGATCATCAT CCCGGTGGCC GAGCGGCTCG GCAACAACGT CGTCGAGTTC
GACCACCTGA CCAAGGGCTT CGGCGACAAG TTGCTGATCG ACGATCTGAC CTTCAAGCTG
CCGCCCGGCG GCATCGTCGG CGTGATCGGC CCGAACGGCG CCGGCAAGAC CACATTGTTC
CGGATGATCA CCGGGCAGGA GAAGCCCGAC GGCGGCACCA TCACCGTCGG CGAGACCGTG
CATCTCGGCT ATGTCGATCA GTCGCGCGAC AGCCTCGACG CCAAGAAGAC GGTGTGGGAG
GAGATTTCCG GCGGCAACGA GCAGATCCTG CTCGGCAAGA AGGAAGTCAA TTCGCGCGGC
TATTGCTCGT CGTTCAATTT CAAGGGCGGC GACCAGCAGA AGAAGGTCGG CTCACTGTCG
GGCGGTGAGC GCAACCGCGT CCACCTCGCC AAGATGCTGA AGTCGGGCGC CAACGTGCTG
CTGCTCGACG AACCGACCAA CGACCTCGAC GTCGACACGC TGCGCGCGCT CGAAGAGGCG
CTGGAGGATT TCGCCGGCTG CGCCGTGATC ATCAGCCATG ACCGCTGGTT CCTCGACCGC
ATCGCCACGC ATATCCTGGC GTTCGAGGAC GACAGCCATG TCGAGTGGTT CGAGGGTAAC
TTCCAGGACT ACGAGAAGGA CAAGATGCGG CGGCTGGGGC AGGACGCCAT CATCCCGCAC
CGCGCGAAGT ACAAGAAGCT GACGCGTTGA
 
Protein sequence
MARQFVYFMQ GLTKAYPTRK VLDNVHLSFY PDAKIGVLGV NGAGKSTLLK IMAGIDKEYT 
GEAWVADGAR VGYLEQEPQL DAALNVRENV MLGVAKQKAI LDRYNELAMN YSEETADEMT
ALQDQIESAG LWDLDSQVDQ AMDALRCPPD DADVTKLSGG ERRRVALCKL LLDRPELLLL
DEPTNHLDAE SVSWLENHLR NYPGAILIVT HDRYFLDNVT SWILELDRGK GIPYEGNYSS
WLVQKQKRLL QEGREDAAHQ KTLEREQEWI ASSPKARQAK SKARYQRYDE LLAKASEKQT
QTAQIIIPVA ERLGNNVVEF DHLTKGFGDK LLIDDLTFKL PPGGIVGVIG PNGAGKTTLF
RMITGQEKPD GGTITVGETV HLGYVDQSRD SLDAKKTVWE EISGGNEQIL LGKKEVNSRG
YCSSFNFKGG DQQKKVGSLS GGERNRVHLA KMLKSGANVL LLDEPTNDLD VDTLRALEEA
LEDFAGCAVI ISHDRWFLDR IATHILAFED DSHVEWFEGN FQDYEKDKMR RLGQDAIIPH
RAKYKKLTR