Gene RPB_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0118 
Symbol 
ID3908089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp129405 
End bp131093 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content72% 
IMG OID637882000 
Producthypothetical protein 
Protein accessionYP_483741 
Protein GI86747245 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCA GGCCGAAGCG TGGCGCATTT TCCGCCGGCG GCGTCGCGGC GCCGCTGCGC 
TATGCGCTGT TCCGGCGGAT CTGGCTGGCC AGCCTGCTGT CCAATCTCGG CCTGATGATC
AACGGCGTCG GCGCCGCCTG GGCGATGACC CAGATGACGT CCTCGGCCGA CAAGGTGGCC
TTGGTGCAGA CCGCCCTGAT GCTGCCGATC ATGCTGGTGG CGATGCCGGC CGGCGCGATC
GCCGACATGT ATGACCGCCG GCTGGTCGGG CTGGCGTCGC TGACGCTCGG CCTCGGCGGC
GCCACGGCGC TCGCGGTGCT GGCGCATCTC GGCCTGGTGA CGCCGGAGAT CCTGCTCGCC
TTCTGCTTCG TGATCGGCAC CGGCATGGCG CTGTTCGGCC CCGCCTGGCA GGCCTCGGTC
AGCGAGCAGG TGCCGGGCGA GGCGTTGCCG GCCGCGGTGG CGCTGAACGG TATCAGCTAC
AACATCGCCC GCAGCTTCGG CCCCGCGGTC GGCGGCATCG TGGTGGCAAG CGCCGGCGCG
GTCGCGGCAT TCGCGGCGAA TGCGGTGCTG TATCTGCCGC TGCTGGCGGT GCTGTTGCTG
TGGCGGCGGG TCAGCGAGCC GCCGCGGTTG CCGCCGGAGC GGCTCAACCG CGCGATCGTG
ACCGGCGTGC GCTACATCGC CAACTCGCCG TCGATCCGGA TCGTGCTGGC GCGAACGCTG
ATCACCGGCA TCGCCGGCTC GTCGGTGCTG GCGCTGATGC CGCTGGTGGC GCGCGACCTG
CTGAAGAGCG GCGCCGAGAC CTACGGCATC CTGCTCGGCG CGTTCGGGGT CGGCGCGGTG
ATCGGCGCGC TCTATGTCGG GATGGTGCGC GAGCGGATGA GCAGCGAGGC CGCGATCCGC
AGCTGCGCGC TGATCATGGG CGTGGCGATG GCGGCGGTGG CGATGAGCCG CTGGTCGATA
CTCACCGCCG CGGCGCTGGT GATTGCGGGC GCGGTGTGGA TGCTGGCGAT CGCGCTGTTC
AACATCGGCG TGCAGCTGTC GGCGCCGCGC TGGGTCGCCG GCCGCTCGCT GGCGGCGTTC
CAGGCCTCGA TCGCCGGCGG CATCGCGATC GGGAGCTGGG GCTGGGGCCA CGTCGCCGAT
CTCTCGGGCG TCGCGCCGTC GATGCTGATC TCGGCGGGCG CCATGGTCGT CTCGCCGCTG
CTCGGGCTGT GGCTGCGGAT GCCGGCGGTC GGTACCCAGA TCGAGGACGC CGACCTCCTG
GCCGACCCGG AAGTGCGGCT GGCGCTGACG CCGCGCAGCG GACCGCTGGT GGTCGAGATC
GAGTACCGCG TCGATCCCGA CGACGCCCGC GCGTTTCACG GGGTGATGCA GCAGGTGCAG
CTCAGCCGCC AGCGCAACGG CGCCTATGGC TGGTCGATCG CCCGCGACAT CGCCGATCCG
GAATTGTGGA CCGAGCGCTA TCACTGCCCG ACCTGGCTCG ACTACTTGCG GCAGCGCAAC
CGCTCGACCC AGGACGATCG CAATCTGCAT CAGCGCGCGA TGGCGTTCCA CCGCGGCGCA
GACCCGATCC GGGTCCGCCG GATGCTGGAG CGGCCGTTCG GATCGGTGCG CTGGAAGGAC
GATTCGCCCG ATCGCGCCAC CGGCACCGAA GTGCTGCCGG TCGCCGGCGT CAGCGGCGGC
TCGACCTGA
 
Protein sequence
MTGRPKRGAF SAGGVAAPLR YALFRRIWLA SLLSNLGLMI NGVGAAWAMT QMTSSADKVA 
LVQTALMLPI MLVAMPAGAI ADMYDRRLVG LASLTLGLGG ATALAVLAHL GLVTPEILLA
FCFVIGTGMA LFGPAWQASV SEQVPGEALP AAVALNGISY NIARSFGPAV GGIVVASAGA
VAAFAANAVL YLPLLAVLLL WRRVSEPPRL PPERLNRAIV TGVRYIANSP SIRIVLARTL
ITGIAGSSVL ALMPLVARDL LKSGAETYGI LLGAFGVGAV IGALYVGMVR ERMSSEAAIR
SCALIMGVAM AAVAMSRWSI LTAAALVIAG AVWMLAIALF NIGVQLSAPR WVAGRSLAAF
QASIAGGIAI GSWGWGHVAD LSGVAPSMLI SAGAMVVSPL LGLWLRMPAV GTQIEDADLL
ADPEVRLALT PRSGPLVVEI EYRVDPDDAR AFHGVMQQVQ LSRQRNGAYG WSIARDIADP
ELWTERYHCP TWLDYLRQRN RSTQDDRNLH QRAMAFHRGA DPIRVRRMLE RPFGSVRWKD
DSPDRATGTE VLPVAGVSGG ST