Gene Rpal_5295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5295 
Symbol 
ID6412996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5712919 
End bp5714205 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content66% 
IMG OID642715184 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001994256 
Protein GI192293651 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGCGC AGCCCACGCC GCAAGGTGCC TGGAGGATCA CCTTCCTGCT GTTTCTATTC 
ATGGTGGTCA ACTTCGCCGA CAAGATTGTC GTAGGCCTGG CCGGCGTGCC GATCATGCAG
GAGCTGAAGC TCTCGCCCGA ACAATTCGGT CTGCTCGGCT CGTCGTTCTT CTTCCTGTTC
TCGATCACCG CGATCGTGGT CGGCTTCATC GTCAATCGGG TTGAGACCAG ATGGGTGCTG
TTGGCGCTGG CGCTGGTGTG GGCGGTGGCG CAGTTTCCGA TGGTCGGCGA GGTCTCCTTC
GCCACCTTCG TGATCTGCCG CATCATTCTC GGCGCCGGCG AAGGGCCAGC GTTCTCGGTC
GCAGCGCATG CGATCTACAA GTGGTTTCCG GATCATCAGC GCACGCTGCC GACCGCGATC
CTGTCGCAGG GCTCGGCGTT CGGTGTGATC CTGGCGGTGC CGGCGCTGAA CTGGATCATC
GTCAATCACT CCTGGCACTA CGCCTTCGCG GCGCTCGGCA TCGTCGGGCT GATGTGGGCG
GTGGCGTGGC TCGCGCTCGG CAAAGAGGGG CCGCTGGTGC CGAGCCCCGC GGCGGCCGCC
GCCGAGGTGC GGATTCCTTA CGTGCGGCTG CTGACCTCGC GCACTTTCAT TGGCTGCGTG
CTGGCAACAT TCGGTGCCTA TTGGGCGCTG TCGCTCGGGC TGACCTGGTT TACCACCTTC
ATCGTGCAGG GGCTGGGCTT CAGCCAGCAC CAGGCCGGGC TGGTCTCGAT CACGCCGTGG
GTGTTCGGCG CCTGCGTGGT GCTGTTCACC GGCTGGCTGT CGCAGCGGCT GATGCAGCGC
GGCGTCTCCA GCCGGATGGC GCGCGGCGTG CTCGGTGCGG CGCCACTCTT GGTCGGCGGC
GCCATCATCC TGATGCTGCC CTATATCGAC AGTCCCACCG CACGGATCGT CGCCCTGGTG
GTCGGCTCCG GCCTGTGCGG CTCGATCTAC GTGGTGTGTC CGCCGATGAT CGCCGAGTTC
GCTCCGGTGT CGCAGCGCGG CGCCGCGATC GCGATCTACG GCGCGCTGTA TACGCTGGCG
GGGATCATCG CACCGTGGGT GATGGGCAGC GTACTGCAGC ACTCAGCCTC GCTGCTTCAC
GGTTACATCG TCGGCTACGC CATCAACGGC GCCGTGATGA TCGTTTCGGG TGTCGCCGGC
CTGTTGCTGC TGTGGCCGAA CACCGAGCGG GCACGGCTGT TGCGGAGCGC GGAGCCGGTG
TCACTGGGAC TGCGCAAGCC GGCCTAA
 
Protein sequence
MIAQPTPQGA WRITFLLFLF MVVNFADKIV VGLAGVPIMQ ELKLSPEQFG LLGSSFFFLF 
SITAIVVGFI VNRVETRWVL LALALVWAVA QFPMVGEVSF ATFVICRIIL GAGEGPAFSV
AAHAIYKWFP DHQRTLPTAI LSQGSAFGVI LAVPALNWII VNHSWHYAFA ALGIVGLMWA
VAWLALGKEG PLVPSPAAAA AEVRIPYVRL LTSRTFIGCV LATFGAYWAL SLGLTWFTTF
IVQGLGFSQH QAGLVSITPW VFGACVVLFT GWLSQRLMQR GVSSRMARGV LGAAPLLVGG
AIILMLPYID SPTARIVALV VGSGLCGSIY VVCPPMIAEF APVSQRGAAI AIYGALYTLA
GIIAPWVMGS VLQHSASLLH GYIVGYAING AVMIVSGVAG LLLLWPNTER ARLLRSAEPV
SLGLRKPA