Gene RPB_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2999 
Symbol 
ID3910798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3413803 
End bp3415029 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content70% 
IMG OID637884905 
Productmajor facilitator transporter 
Protein accessionYP_486612 
Protein GI86750116 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.548657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0674552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACCAGC GCGCATCGCA GCGGACCGGC CTCGTCGTGC TCGGCGTGTG CTTCGTGCTG 
TCGCTGCTCG GCCGCGGCCT GGGCGAGAGC TTCACCGTCT TCCTGCTGCC GATCTCGCAA
TCGTTCGGCT GGGACCGCGC CGAGGTGGTG TCGATCTATT CGCTGACTGC GCTGTGCAGC
GGCATCGCCT CGCCCTTCGT CGGCCGTCTG TTCGACCGCT CCGGCCCGCG CCTCGTCTAT
ATGCTCGGAC TGCTGCTGCT CGGCGGCGCC TTCCTCGGTG CGGCCATCGC GCAGCAGCTC
TGGCAATTGC AGCTCGCCGT CGGCCTCTGC GTCGGGCTCG GCATCGCCTT CACCGGCACC
GTGCCGAACT CGATCCTGCT CGGCCGCTGG TTCGGACCGC GGCTGCCAAC CGCGATGGCC
GTGGTGTATT CCGCGACCGG CGCCGGCGTG CTGCTGCTGC TGCCGATCGC CCAGCTGCTG
ATCGAGCGTT CCGGCTGGCG CGGCGCCTAC GAGTTGCTGG GCGCGGCGAT GCTGCTGCTG
CTGGTGCCGC TGCTGATGCT GCCGTGGCGG CGCTATGCCC AGGGTGCGCC GGGCGGCATC
GCGGCGCACG CCGCCTCGCT CGACGCACCC GACGACGGCT GGACGCTGCG CGCGGCGATG
CGGCACCACG CGTTCTGGGC GCTGTTCGCG ACGTTCTTCT TCACCGCGAT CGGGATGTAC
GCGATCGCAG CCCAAGTCGT CGCCTATCTG ATCGACGCCG GCTTTCCGCC GCTGCAGGCG
GCGACCGCCT GGGGCTTCTC CGGCGTGGTG CTGGTGATCG GCATGCTCGG CGTGAGCTGG
CTCGACGGCG TGATCGGCCG CCGGCCCTCG ATCCTTTTCT CCTATGCGGT CTCGATCGCC
GGCATCGTGA TGCTGTGGCT GCTGAAATCC TATCCCGACT ACGTCCTGCT GACCGGCTTC
GTCGTCTGCT TCGGCAGCAT GATCGGCTCG CGCGGCCCGC TGATCACCGC GACCGCGATG
AAGCTGTTTC GCGGCCGGCA CGTCGGCCTG ATCTACGGCA CGATCGCGAT CGGCAGCGGG
CTCGGCTCGG CGTTCGGCTC CTGGTGCGGC GGCCTGATCC ACGACCTCAG CGGCAGCTAC
GACCCGGTGA TCGGCTTCGC GCTGGTCGCC GTGCTGCTCG GGATGATTCC GTTTCTGGTG
GTGCCGGCGC TGCGCGAGCG GTCCTGA
 
Protein sequence
MDQRASQRTG LVVLGVCFVL SLLGRGLGES FTVFLLPISQ SFGWDRAEVV SIYSLTALCS 
GIASPFVGRL FDRSGPRLVY MLGLLLLGGA FLGAAIAQQL WQLQLAVGLC VGLGIAFTGT
VPNSILLGRW FGPRLPTAMA VVYSATGAGV LLLLPIAQLL IERSGWRGAY ELLGAAMLLL
LVPLLMLPWR RYAQGAPGGI AAHAASLDAP DDGWTLRAAM RHHAFWALFA TFFFTAIGMY
AIAAQVVAYL IDAGFPPLQA ATAWGFSGVV LVIGMLGVSW LDGVIGRRPS ILFSYAVSIA
GIVMLWLLKS YPDYVLLTGF VVCFGSMIGS RGPLITATAM KLFRGRHVGL IYGTIAIGSG
LGSAFGSWCG GLIHDLSGSY DPVIGFALVA VLLGMIPFLV VPALRERS