Gene Rpal_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3944 
Symbol 
ID6411625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4231001 
End bp4232287 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content70% 
IMG OID642713825 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001992915 
Protein GI192292310 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTCCG ACGCCCTCGC CGCCGCGCTG GCCCGCCGCA ACATCCACTA CGGCTGGGTA 
ATGGTCGGCG TCACCTTCCT CACCGCGCTG GTCAATGCCG GCGCGGTCGG CGCCCCCGGC
GTGTTCATCG TGCCGCTGCA GCAGGAATTC GGCTGGAGCA CCGCGGATAT TTCGTCCGCA
CTGTCGATCC GCTTCGTGCT GTTCGGGCTG ATGGCGCCGT TCGCCGCGGC GCTGATGAAC
CGGTTCGGTC TGCGGCAGGT GACGCTGACT GCGCTCGTGG TGGTCGCGAT CGGGCTGATC
GCGTCGCTGG CAATGACGGC GCTGTGGCAG CTGATGCTGC TGTGGGGCGT GGTGGTCGGC
GTCGGCACCG GCCTCACCGC GCTGGTGCTC GGCGCCACGG TGGCGACGCG CTGGTTCGTC
GCCCGCCGCG GCCTGGTGAT CGGCATGATG ACCGCGAGCG TCGCCACCGG CCAGCTCGTG
TTCCTGCCGA TCCTCGCCTC GCTCACCGAA CGCTACGGTT GGCGGATCGC GATGGCTTAC
GTCTGCGCGC TGATCGGCGT CGCCGCAATC GCGGTGCTGA TCGCGATGCG CAACCGGCCA
AGCGACGTCG GGCTGCGGCC CTATGGCGAC ACCAGCACCG AGCCGGTGCC GCTGTCGGCC
CAGGCTGTGC CGGGCTCGAT CGCAGCGGCC GCACTCGGCG CGCTCCGCGA CGCGGCCAAG
ACGCGGGTGT TCTGGGTGCT GTTCGGCACC TTCTTCATCT GCGGCGCGTC GACCAACGGC
CTGGTGCAGG TTCACCTGAT CCCGCTCTGC GCCGACTTCA ACATCCCGCA GGTGCAGGCC
GCAGGCCTGC TCGCAGCGAT GGGCGTGTTC GATTTCATCG GCACCATCCT GTCGGGCTGG
CTGGCCGACC GCTACGACAA TCGCTGGCTG CTGTTCTGGT ACTACGGCCT GCGCGGCCTC
AGCCTGCTGG CGCTGCCGTT CACCGACTTT TCGTTCTACG GCCTGTCGCT GTTCGCGGTG
TTCTACGGAC TCGATTGGGT CGCCACCGTG CCGCCGACGG TGCGGCTGAC GGCGCAGAAG
TTCGGCCCCG AGCGCGCCAA TCTGGTGTTC GGCTGGATCT TCGCCGGCCA TCAGCTCGGC
GCCGCCACCG CCGCATTCGG CGCCGGCCTG TCGCGCGACC TGCTGGCGAG CTACCTGCCG
GCCTTCTTCA TCGCCGGCGC CCTGTGCGTG ATCGCCGCCG CCGCGGCGCT GACGATCAGC
AAGGCTGCCA AGCCGGCGGC GGCCTGA
 
Protein sequence
MISDALAAAL ARRNIHYGWV MVGVTFLTAL VNAGAVGAPG VFIVPLQQEF GWSTADISSA 
LSIRFVLFGL MAPFAAALMN RFGLRQVTLT ALVVVAIGLI ASLAMTALWQ LMLLWGVVVG
VGTGLTALVL GATVATRWFV ARRGLVIGMM TASVATGQLV FLPILASLTE RYGWRIAMAY
VCALIGVAAI AVLIAMRNRP SDVGLRPYGD TSTEPVPLSA QAVPGSIAAA ALGALRDAAK
TRVFWVLFGT FFICGASTNG LVQVHLIPLC ADFNIPQVQA AGLLAAMGVF DFIGTILSGW
LADRYDNRWL LFWYYGLRGL SLLALPFTDF SFYGLSLFAV FYGLDWVATV PPTVRLTAQK
FGPERANLVF GWIFAGHQLG AATAAFGAGL SRDLLASYLP AFFIAGALCV IAAAAALTIS
KAAKPAAA