Gene RPB_3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3891 
Symbol 
ID3911695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4444352 
End bp4447216 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content68% 
IMG OID637885792 
ProductABC transporter ATPase component 
Protein accessionYP_487495 
Protein GI86750999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCTCG TCGTCGCCCT GTTTCTGGCT CAGGATGGCA TCGCCAACGG CGTGGTCTAC 
GCGCTGCTGG CGGTGGCGAT CGTGGTGCTG TTCTCGGTCA CTCGCATCAC TTTCGTGCCG
CAGGGCGAGT TCGTCTCCTA TGGCGCATTG ACCTTCGTCA CCCTGCAGGA CGGCCAGCGC
CCGGGCACGT TCGCGCTGCT CGGCCTCCTG ATCGCGCTGC ACCTCGTCGT CGACATCGCC
GCCGGCTGGC GCAGGCAGCG GCTCGCGACG CTGCTCGTGT CGAGCCTGCG CAGCGTCGGC
GCGGCGCTGA TCGTGCTGGC GCTGGCCTAT GCCGCGACGA GTTTCGAATT GCCGCTCATC
CTGAAGGCGG TGATCGCGAT CGCGGTGATC ACCGCCGCCG GGCCGCTGCT GTACCGCATC
GTCTATCAGC CGCTGGCCGA TGCTTCGGTG CTGGTGCTGC TGATCGTCTC GGTGGCGCTG
CATTTCGTCA TGACGGGGCT CGCGCTGTAC ATCTTCGGTC CCAGCGGCCA GCGCTCCGCG
CCGCTGATCG ACTTCACCAT GCCGCTGATG GGCGTTCCGA TCGCCGGGCA GACGCTGCTG
ATCCTCGGCG TCACCATCGT CGTCATCGTG GCGCTGTATG TCGCTTCGCA GCACACGCTG
TACGGCAAGG CGATGCAGGC GGTGGCGCTG AACCGCACCG GCGCGCGGCT GATGGGGATC
TCGACGACGC TGGCGGGCCG CACCTCGTTC CTGCTCGCCG CCTTCATCGG CGCGCTGGCC
GGCGTGCTGA TCTCGCCGAC CACCACCATC CTGTACGACA CCGGCTTCCT GCTCGGCCTC
AAGGGCTTCG TCGGCGCCAT TCTCGGCGGT CTCGCGAGCT ATCCGATCTC GGCGCTCGGC
GCGCTGCTGA TCGGCCTCTT GGAGAGCTAC TCGTCGTTCT TCGCCTCGAC CTACAAGGAA
GTCATCGTCT TCAGCCTGAT CATCCCGATC CTGCTGGTGC AGACGCTGCG CCAGCACCGC
ATCAAGGAGG ACGACGACCA CGGCGGCGAA GCGATCCCGA CGCAACTGAG CCTGTCGCCG
GAGGCGCTGC GCCGCCGCCG GCAGATCAGG ACCGGCCTCG GCGCCGCCTT CGTGCTCGCC
GTCGCCGCGG CGCCGCTGCT GCTGTCGAAC TACGAGATCG CGCTCTTGAA CTATGTCGGG
CTCGCCGCGA TCGTGGTGCT TGGCCTGGTG CTGCTCACTG GCGTTGCCGG GCTGACTTCG
TTCGCGCAGG CGGCCTATGT GGGCATCGGC GCCTACATCA CCGGCTATGT CTCGTCGGTC
TATGGCCTGT CGCCGTGGCT GACGCTGCCG ATGGCGCTGG CGGTCGCCTT CGTGCTGGCG
CTGTTCGGCT CGCTGATCAC GGCGCGGCTG TCGGGGCACT ATCTGCCGCT GGCGACGCTG
TCGCTGGCGG TGGTCGCCTA TTATCTGTTC GCCGCTTTGC CGCAGACCGG CGGCCAGGCC
GGCATGACCA ACATTCCGCC GCTGACCATC TTGGGCATCG CGATCGTGAG CCCGAAGGCG
TGGTATGTCG GGATCTGGGG CGCGCTGCTG CTGCTGCAAC TGGCGATGGC CAATCTGCTC
GACTCGCGCC CGGGCCGCGC CATCCGCGCG CTGAAATCCG GCACCATCAT GGCCGAGAGC
CTCGGCGTCG ACACCTGGCG GGCGAAGATC GCCGCCTTCG TGATCGCATG CCTGCTCGCC
GCACTCGCCG GCTGGTTCTA CGCGCACTAC CAGCGCTTCC TCAATCCGTC GCCGTTCTCG
TTCAACCAGG GCATCGAATA TCTGTTCATG GCGGTGATCG GCGGCGCCGG CTCGATCGGC
GGCGCGCTGG TCGGCTCCGG CATCGTCGTG CTCGCCAATC AGTGGCTGCA GACCAACCTG
CCGCTGGTGC TCGGGATGCA GGGCGATTTC CAGGTGATCA TCTTCGGCGT CGCCGCGATG
GCAATGCTGC AATTGCTGCC GCGCGGCATC TGGCCGGCGC TGCTCGATCT GTTCCAGATG
CGGCTGCCGA AATCCTGGAC GTTCTCGACC GACGCGGCCC CCTTGCCGGC GCGCCCCAAG
CCCGCGCCGG GCGAGACCGT GCTGGCGGTG CGCGGGGTCT GCAAGAATTT CGGCGCGATC
GCCGCCAACC GCAACATCAG CCTCGACGCC AAAGCCGGCG AAATTCTCGC GCTGATCGGC
CCCAACGGCG CCGGCAAGAG CACGCTGTTC GACCTGATTT CCGGGGTGCA GCGTCCGTCG
AGCGGCACGG TGCATTTCCT CGGCCACGAA AGCAGCTACT TCCCGCGCAC GCTGTCGCGC
GGCGGCATGG GCCGCACCTT CCAGCACGTC CGCATCCTTC CGGAAATGAC GGTGGTGGAG
AACACCGCGC TCGGCGCCCA TGCGCGCGCC GACATCTCGT TCCTGTCGTC CGCGCTGCGG
CTGGACCGCG GCAAGGAAGC GATGCTGATC GCCGAGGCGC TGCGCCAGCT CGAGCGCGTC
GGGCTCGCGA GCAGCGCGAC GCAGATCGCC GGCAGCCTCG CGCTCGGCCA GCAGCGCGTG
CTCGAAATCG CCCGCGCGCT CGCCTCCGAC CCCTGCCTGC TGCTGCTCGA CGAGCCGGCG
GCCGGGCTGC GCCATCTCGA AAAGCAGGCG CTGGCGCGGC TCTTGAAGCA GCTCAAGGCC
GAAGGCATGG CGGTGATCGT CGTCGAGCAC GACATGGATT TCGTGATGAA TCTCGCCGAC
CGCATCGTCG TCATGCAATT CGGTGAGAAG CTCGCCGAGG GCACGCCGTG CGAGATCCAG
CGTAACCCGG CCGTGATCGA AGCCTATCTC GGAGGCGCCG AATGA
 
Protein sequence
MNLVVALFLA QDGIANGVVY ALLAVAIVVL FSVTRITFVP QGEFVSYGAL TFVTLQDGQR 
PGTFALLGLL IALHLVVDIA AGWRRQRLAT LLVSSLRSVG AALIVLALAY AATSFELPLI
LKAVIAIAVI TAAGPLLYRI VYQPLADASV LVLLIVSVAL HFVMTGLALY IFGPSGQRSA
PLIDFTMPLM GVPIAGQTLL ILGVTIVVIV ALYVASQHTL YGKAMQAVAL NRTGARLMGI
STTLAGRTSF LLAAFIGALA GVLISPTTTI LYDTGFLLGL KGFVGAILGG LASYPISALG
ALLIGLLESY SSFFASTYKE VIVFSLIIPI LLVQTLRQHR IKEDDDHGGE AIPTQLSLSP
EALRRRRQIR TGLGAAFVLA VAAAPLLLSN YEIALLNYVG LAAIVVLGLV LLTGVAGLTS
FAQAAYVGIG AYITGYVSSV YGLSPWLTLP MALAVAFVLA LFGSLITARL SGHYLPLATL
SLAVVAYYLF AALPQTGGQA GMTNIPPLTI LGIAIVSPKA WYVGIWGALL LLQLAMANLL
DSRPGRAIRA LKSGTIMAES LGVDTWRAKI AAFVIACLLA ALAGWFYAHY QRFLNPSPFS
FNQGIEYLFM AVIGGAGSIG GALVGSGIVV LANQWLQTNL PLVLGMQGDF QVIIFGVAAM
AMLQLLPRGI WPALLDLFQM RLPKSWTFST DAAPLPARPK PAPGETVLAV RGVCKNFGAI
AANRNISLDA KAGEILALIG PNGAGKSTLF DLISGVQRPS SGTVHFLGHE SSYFPRTLSR
GGMGRTFQHV RILPEMTVVE NTALGAHARA DISFLSSALR LDRGKEAMLI AEALRQLERV
GLASSATQIA GSLALGQQRV LEIARALASD PCLLLLDEPA AGLRHLEKQA LARLLKQLKA
EGMAVIVVEH DMDFVMNLAD RIVVMQFGEK LAEGTPCEIQ RNPAVIEAYL GGAE