Gene Pars_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1543 
Symbol 
ID5054172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1398677 
End bp1399825 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content58% 
IMG OID640469084 
Productmajor facilitator transporter 
Protein accessionYP_001153749 
Protein GI145591747 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0259552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGG GGGAGTTTAA AATTGTAGTG CTGAGTGGCC TCGGCTGGAT GTTCGACGCC 
ATGGACGTCT TGATACTCTC ATACTTACTC GTGGCGATGA GAGAAGAGCT GGCGCTGGAT
CGCGCGGCAT CGACGTGGAT CGTTCTGGCA AATAACTTAG GCATGTTCCT CGGCGCCTTC
CTCTTCGGGA AGCTCGCCGA CGTCGTGGGG AGGAAAAAGG TGTTTATGGC CACTATGTTG
CTTTACAGCA TTGCCACCGC GGCGTCCGCC GCCGCTAGGA CGTGGCAGGA GTTCGCCGCA
ATTAGGTTCT TCGTGGGAGT TGGGCTAGGC GGCGAGTTGC CCGTGGTGGC CACGTACGTC
TCGGAGAACT CCCCACCTGA GAGGAGGGGG AGAAATGTGG TTCTCCTAGA GAGCTTCTGG
TCGATAGGCG CTCTCCTCGC CGCCGCCGTG TCGCTCTTTA TCTTCACCAC ATTAGGGTGG
AGGACGGCGC TTGTGTTGAT GGGGGCCACA GCCTTCTACG TCTTCGTAAT ACGCTCCGCC
CTCCCGGAGT CGCAGAGGTG GCTGGAGAGG ATCAAAGAGG GAGCCTCGGC GGAGCTTAAG
CCTTACGCCG CGAGACTCGC CATAGCTTCA GCCATTTGGT TCCTCCTAGC CTTTGGCTAC
TACGGCGCGT TTATCTGGTT GCCCACAATG CTCAGGACAG AGAGAGGCTT CACACAGGTG
GCCACCTACG AGTTCATGTT TTTGACAACC ATCGCCCAGC TCCCGGGCTA CTTCTCAGCG
GCATACCTCG TGGAGAGAGT GGGCAGGAGG CCAATAGCGG CGGCGTACTT CGTAGCCTCG
GCTCTATCTG CGGTTTTGCT GATATACAGC ACGTCGTACG CCCAGCTCTT CTACGCGGCC
CTCGCACTCA ACTTCTTCAA CCTCGGGGTC TGGGGTGTCG TGTACGCATA CACCCCCGAG
CTTTTCCCCA CTTCTATAAG GGGCCTTGCG ACAGGTCTAG CGGGCTCAGC CGCAAGGATC
GGAATGATTA TTGGACCTAC GCTGTATCCG CTTTGGGCCT CCGTAGCATT CATAGGCGTC
GCAGTTGCGT GGCTAATAGC GTCAGCCCTA GTAGCGCTTT TGCCCGAGAC AAAAGGCCGT
GAGGTGTAG
 
Protein sequence
MTRGEFKIVV LSGLGWMFDA MDVLILSYLL VAMREELALD RAASTWIVLA NNLGMFLGAF 
LFGKLADVVG RKKVFMATML LYSIATAASA AARTWQEFAA IRFFVGVGLG GELPVVATYV
SENSPPERRG RNVVLLESFW SIGALLAAAV SLFIFTTLGW RTALVLMGAT AFYVFVIRSA
LPESQRWLER IKEGASAELK PYAARLAIAS AIWFLLAFGY YGAFIWLPTM LRTERGFTQV
ATYEFMFLTT IAQLPGYFSA AYLVERVGRR PIAAAYFVAS ALSAVLLIYS TSYAQLFYAA
LALNFFNLGV WGVVYAYTPE LFPTSIRGLA TGLAGSAARI GMIIGPTLYP LWASVAFIGV
AVAWLIASAL VALLPETKGR EV