Gene Paes_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1578 
Symbol 
ID6460123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1718727 
End bp1720043 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content51% 
IMG OID642725566 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002016243 
Protein GI194334383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0877587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA ATGCCTCCCC AAAGAGAATC GGGCCCATCG AGCTGGCACC GACCATACAT 
CGCAGCCATG CCTGGACGTT TTTCTATGCC GCATTTTTTT CCATCGGCTT CATCACCTTT
CTCTCGATAG GCCAGACCTA CATTCTCAAT GTCCATCTCA ACATCCCGGT GTCTGAGCAG
GGAGCAATCA GCGGCGACCT GGTCTTTCTG ACAGAACTGA TTACGCTGGT ATTTTTTATT
CCTGCCGGCA TTCTCATGGA CCGTATAGGA AGAAAGCCTG TCTATGTGGC AGGGTTCCTG
CTCATCGCCG CAACCTATAT CCTCTACCCT TTCGCTTCGT CAGTTAACGA TATGATGCTC
TATCGCATCA TCTATGCTCT TGCCGTTGTT GCCATAGCCG GATCACTCTC GACAGTGCTT
GTCGACTACC CCGCAGACCG GTCACGAGGC AAAATGGTAG CGATCGTCGG TCTCCTTAAC
GGGCTTGGAA TTGTCATCAC CAACCAGTTT TTCGGATCCC TTCCCGAAAT GCTCACCATA
AAAGGAGTCG ACGCAATCCA GGCAGGCTTT ATAACTCACT TCTCGATTGC GGCTCTGGCA
GTCCTTGCGG CAGTCATCTG CGCCATCGGC CTGAAAAAGG GAACCCCTGT CACCGAAGAA
GAACGCCCTG CGCTCAAAAC GCTTTTGCAA AGCGGACTCG TCGCAGCCAA AAATCCGCGA
ATTATGCTCT CCTATACAGC TGCGTTTATT GCACGAGGCG ATCAGTCCAT CAACGGAACG
TTCATCAGTC TCTGGGGAAT TACCGCTGGC CTGGCTATGG GTATGGAGTC CGGCGAAGCA
TTCAGAAAAG GGACAACTAT TTTCATCATC ACTCAGGTAG CAGCACTGCT CTGGGCGCCT
CTGATCGGCC CGGTCATTGA CCGTTTCAAC CGGGTCAGCG CACTCGGATT CTGCATGTTT
CTGGCCATGA TCGGCAATCT GTCGGTTCTT GTGCTTGATC ACCCGTTTCA AAATATCGGC
TATCTGGTCT TTATTCTCAT GGGAATCGGA CAGATCAGCG TTTTCCTCGG CGCCCAGTCA
CTGATCGGTC AGGAAGCCCC TAAAGCGACA CGAGGTTCGG TAATCGGCGC ATTCAATATC
AGTGGAGCTA TTGGGATTCT GCTTATCGCT TCGGTCGGCG GACGAATGTT TGACGGCATA
AGCCCAAAAA CACCTTTTGT CATTGTAGGG ATTATCAATG CCTTACTGGT AGTCTACAGC
ATCTATGTGC GCATCAAAGC TCCGCACAAG CTTGAAACAA GCCCTAAAAG CGCATGA
 
Protein sequence
MNNNASPKRI GPIELAPTIH RSHAWTFFYA AFFSIGFITF LSIGQTYILN VHLNIPVSEQ 
GAISGDLVFL TELITLVFFI PAGILMDRIG RKPVYVAGFL LIAATYILYP FASSVNDMML
YRIIYALAVV AIAGSLSTVL VDYPADRSRG KMVAIVGLLN GLGIVITNQF FGSLPEMLTI
KGVDAIQAGF ITHFSIAALA VLAAVICAIG LKKGTPVTEE ERPALKTLLQ SGLVAAKNPR
IMLSYTAAFI ARGDQSINGT FISLWGITAG LAMGMESGEA FRKGTTIFII TQVAALLWAP
LIGPVIDRFN RVSALGFCMF LAMIGNLSVL VLDHPFQNIG YLVFILMGIG QISVFLGAQS
LIGQEAPKAT RGSVIGAFNI SGAIGILLIA SVGGRMFDGI SPKTPFVIVG IINALLVVYS
IYVRIKAPHK LETSPKSA