Gene RPC_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1045 
Symbol 
ID3969655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1146532 
End bp1147968 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content64% 
IMG OID637924156 
Productmajor facilitator transporter 
Protein accessionYP_530928 
Protein GI90422558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAC TCACGGCCGC AGCGCCGTAC GTCATTCGGC GCCCACAAGA CGTTGTCGAT 
ATCGTCAACG CCCACCCGGC GACCCGTTCC GGTCTCGCCG TCACGCTGAT CGCGCTGGGC
GGCGTCCTGA TCGATGCCTA CCAGGCCGCG ATGATCGGCT TCGGCAATTC GTTCATCGCC
ACGCAGTTCG GCATTTCGCC GGGGCTTGCG GCGACGGTGA ACGCCTCGGT TCTGGTCTCG
GCGCTGATCG GCGGCTTGTT GTCGAACCGG ATCATCAATC GCTTCGGCCA GCGCGGCGGC
TTCCTGATCG GCATGGGGCT TTGCACGGTC GGAGCCTTCG CGATCGCTTT CGCTCCAAAT
ATCTGGGCTG TGCTGGTGAG TCGGTTGGTG ATGGGATTAG GTCTCGGCAT CGATTTCCCG
CTGGCAACCG GCGCGGTCGC CGAATTGCGC GGCTCGTCCT CGAAGAAGTC CGGCACGTCC
GTCAATCTTT GGCAGATGGG ATGGTACCTT TCGACCACGG TGGTCTATCT CATCTTGCTG
TCGCTCTCGG CGGCGGCGGT CGAGCAGCCG ATGCTGTGGC GTTACGGCAT TTTCATCGGC
TCCGCCTTCG CCGTGGTCGT CATGGTGCTG CGCTATATCT ACATCGGGGA GTCGGCGATG
TGGGCGGCGC GCACCTATCG CTACGACGAG TCCTGCAAAA TCCTCAGCGA TCGTTATGAC
GTCCGGGCCG AAGTGGCCGG GGACGCCACC CACGAGAAGG AGGCAGGGGC CAAGCTGCAC
GGTGCCTACT CTGTGCTGTT CCGTGCGCCC TATCGTAGAC GGACCATTCT CGGCTGTGTG
GTCGCCACGA TGCAGGCCTG GCAATACAAT GCGGTCGGCG TCTATCTCCC CTTGACGCTT
GCGGGCATTC TCTCCGGCGG GCTGTCGAAC GCGCTGTGGG GCTCGGCCGC CGTCAATGCG
CTATGCGGCG TCACCGGCGG GGCGATCGGT TCGATCCTGG TGCAGAAGAT CGGGGCGCGT
CGGCAGTCGA TGTTCGGCTT CGGCATGGTG GTGCTGGCGC TGCTGATGCT GGGCTTCATG
GGCAAGGATA GCCCGTGGCT GGCGCTGGTG TTGCTCGGGC TGATCATCTT CTTTCATTCG
GCCGGTCCCG GCGGTCTCGG CATGACCATC GCGACCTTGT CGTACCCGCC GAGCATTCGC
ACGGCGGGCG TCGGTTTTGC GCGCGCGATC ATGCGCGCCG GCGCCCTTTG CGGGCTGATC
TTCTGGCCGA TCCTTTGGCA GAACCTGCGC ACCGACGCTT TCTACTGGCT GGCAATCGTG
CCGCTGGTCG GATTTCTGAC CTGTCTTGCG ATCCGCTGGG AGCCGATCGG CGCCAACGTC
GACGCCGAAG ACGCTTCGGT GCTCTCTATC GTTGCAGTGA AGGAGAATGC GGCATGA
 
Protein sequence
MNTLTAAAPY VIRRPQDVVD IVNAHPATRS GLAVTLIALG GVLIDAYQAA MIGFGNSFIA 
TQFGISPGLA ATVNASVLVS ALIGGLLSNR IINRFGQRGG FLIGMGLCTV GAFAIAFAPN
IWAVLVSRLV MGLGLGIDFP LATGAVAELR GSSSKKSGTS VNLWQMGWYL STTVVYLILL
SLSAAAVEQP MLWRYGIFIG SAFAVVVMVL RYIYIGESAM WAARTYRYDE SCKILSDRYD
VRAEVAGDAT HEKEAGAKLH GAYSVLFRAP YRRRTILGCV VATMQAWQYN AVGVYLPLTL
AGILSGGLSN ALWGSAAVNA LCGVTGGAIG SILVQKIGAR RQSMFGFGMV VLALLMLGFM
GKDSPWLALV LLGLIIFFHS AGPGGLGMTI ATLSYPPSIR TAGVGFARAI MRAGALCGLI
FWPILWQNLR TDAFYWLAIV PLVGFLTCLA IRWEPIGANV DAEDASVLSI VAVKENAA