Gene RPD_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3622 
Symbol 
ID4024136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4037649 
End bp4038884 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID637963826 
Productmajor facilitator transporter 
Protein accessionYP_570746 
Protein GI91978087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.769068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.781308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAG CTCAGCTCAG CCCGTTTCAG CGCTGGTCGA TCCTGATCGG CGCCTCGGTG 
CTGCTCAGCC TCGCGATGGG GATGCGGCAG AGCTTCGGAC TGTTTCAACC TTCCGTGATC
CGCGACGTCG GCATCACCAG CGCCGATTTC TCGCTGGCGA CGGCTCTGCA GAATATCATC
TGGGGCGTGA CGCAGCCGAT GGTCGGGCTG ATCGCCGACC GCTACGGCTC GCGCTGGGTG
ATGCTCGGCG GCGTGCTGAT CTATGCCGCC GGCCTGGTGC TGATGATGAT CGCCGAATCG
GCGTTGGTAT TTACGCTCGG CTGCGGCGTC TGCGTCGGCA TCGCGTTGTC CTGCACCGCC
TCCAGCATGA CGATGACGGC GACCTCGCGC ACCGTGTCGG CCGCCAAGCG CAGCGTGGCG
ATGGGCGCGG TCTCGGCCGC GGGATCGCTC GGCCTGGTGC TGGCCTCGCC GCTTGCGCAA
ACCTTGATCA CAACCTCGGG CTGGCAGATG GCGCTGATCG GCTTCCTCGG CCTTGCCGCG
GTGATGCTGC CATCCGCCTT TTTCGCGGGG CGGTCCGACG ACATCGAGAT CGACAAGGCC
GACGATCTGG ATCAGTCGGC GGGTCAGGTG GTGCAGACCG CGCTCGGCCA TTCCGGTTTC
ATGGTGATGG CGATCGCGTT CTTCGTGTGC GGGCTGCAGC TCGTCTTCAT CACCACGCAT
CTGCCGAACT ATCTCGCGAT CTGCGGTCTT GACCCCTCGC TCGGCGCCAC CGCGCTGGCG
GTGATCGGGC TGTTCAACGT GATCGGCTCC TACGCCTGCG GCTGGCTCGG CGGTCGCTAT
CCGAAGCAAT ACCTGCTCGG CGGCATCTAT ATCGTGCGCT CGCTGACGAT CGCGGCGTAT
TTCTACTTCC CGGCCTCGGC GACCACGACA CTGGTGTTCG CCGCGGTGAT GGGCGCGCTA
TGGCTCGGCG TGATCCCGCT GGTCAACGGC CTGGTCGCGC AACTGTTCGG GCTGCGCTAC
ATGGCGACGC TGACCGGCAT CGCTTTCTTC AGCCATCAGG TCGGTTCGTT CCTGGGGGCG
TGGGGCGGCG GTATGGTCTA CGATCACCTC GGCAATTACG ATCGCGCCTG GCAGGCCGCG
GTGTTGATCG GGCTGATCGC CGGCACCGCG CAGATGATGA TGAATGTCCG TCCGCCGCGG
CGGCGTGAGG AATTGGCGGT GCCTGCCACC GCCTGA
 
Protein sequence
MKAAQLSPFQ RWSILIGASV LLSLAMGMRQ SFGLFQPSVI RDVGITSADF SLATALQNII 
WGVTQPMVGL IADRYGSRWV MLGGVLIYAA GLVLMMIAES ALVFTLGCGV CVGIALSCTA
SSMTMTATSR TVSAAKRSVA MGAVSAAGSL GLVLASPLAQ TLITTSGWQM ALIGFLGLAA
VMLPSAFFAG RSDDIEIDKA DDLDQSAGQV VQTALGHSGF MVMAIAFFVC GLQLVFITTH
LPNYLAICGL DPSLGATALA VIGLFNVIGS YACGWLGGRY PKQYLLGGIY IVRSLTIAAY
FYFPASATTT LVFAAVMGAL WLGVIPLVNG LVAQLFGLRY MATLTGIAFF SHQVGSFLGA
WGGGMVYDHL GNYDRAWQAA VLIGLIAGTA QMMMNVRPPR RREELAVPAT A