Gene Pars_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0999 
Symbol 
ID5055645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp891413 
End bp892705 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content62% 
IMG OID640468556 
Productmajor facilitator transporter 
Protein accessionYP_001153231 
Protein GI145591229 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.77003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0952773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGAGGG CCTTGTTCGC CGTCTTCTTA TCGTCTTTTA TAACTCCATT TATGACAAGT 
GGGTTAAGCG TAGCTCTGCC AATAATTGCA AGCGACTTCT CAGTGGGGTC TGCGGAGGCC
ATGGCGATCT TCGTATTGCT TAACCTGGCA GTGGCCATGT GGGTGCTTCC TTTTGGGCGC
CTTGCCGACA TGTGGGGCCC CCATGTGGTT TTCAGAGCTG GGCTGGGGGT TGCGGGGGTT
GGCTTTCTCC TAGCCGCTTT ATCCCCAACA ATCGGCTTCT TCTACGCCTC GTTGCTAGTC
GCCGGTGTGG GGCTTGCCGC GGTGTTTGGA AGCAACAACG CCTTGATCTT CCGCCTCGTC
CCGCCGGAGA AGAGGGCTGA GGCCGTGGGG CTCAACTCCA TGTCGGTCTA CGTAGGCCTG
GTGGTCGGCC CCGTGCTGGG GGGCGCCGTA GCTCAGCTAT CATGGAGGCT GATCTTAGCC
ATAGGAGCGG CCCTCACCGC GGTGCCCTAC GTAATGGTTA GGAAGTCACC ACGCCCCGCG
GGCGGCGGGC GTTTCGACGT CCTAGGCTCG GCCCTCCTAG CTACCTCCAC AGCGCTAGTA
GTAATAGGCG TCTACGCAGG CTCCCCTAGC TTGGCAGTCC CCGGCCTTCT ACTCCTCGCC
GCGGCCATCT TTGTGGAGTG GAGGTCGCCC TCGCCCGTAC TGGATGTTAG GCTCTTCCGC
AACTACGTAT TTACCGCCTC GCTGGCGGCA GCGCTCCTCA ACTACTTGGC CACCTCTGCG
CTACAACCAT CGCTTAGCCT CCTCTTCCAA AAGGCCTTTG CCATGCCCCC CCACTACGCG
GGCCTCCTAC TTAGCACCCA AGCGGCGGCC ATGGCCCTAT TCTCGCCGAT AGCCGGCAGA
GCGGGCAACC GCCTTCCCCT AGCCGCCCTC GCCGCCGCGG GGTCAGCCAT CCTAGCGGTC
ACGCTCTTCG CCTACTCCGC CGCGCCCAGC CCCGCCACCG CGCCGTTAGC GCTTTCCCTA
ATAGGCGTGG GGTTTGCGCT GTTCATTGTA CCAAATACCA CCATTATACT CTCCGCGGCG
CCGCCGGAGA GGAGAGGCAC AGCTTCAGCA CTGATAGCTG AGGCAAGAGT GGTGGGGATG
GCGCTAAGCA ACGCGGCGGC TGGGCAAATC ATGAAAAATG TGCCGAACAT AACGGCCGGA
GTCTCGTCGG TATTGGCCTT CTTAGCCTAC GTATCGCTTG CAACGCTGGC TCTCTCTCTT
GTCCGGGCGG GGGGCCGCCC GCGCAAACGA TGA
 
Protein sequence
MRRALFAVFL SSFITPFMTS GLSVALPIIA SDFSVGSAEA MAIFVLLNLA VAMWVLPFGR 
LADMWGPHVV FRAGLGVAGV GFLLAALSPT IGFFYASLLV AGVGLAAVFG SNNALIFRLV
PPEKRAEAVG LNSMSVYVGL VVGPVLGGAV AQLSWRLILA IGAALTAVPY VMVRKSPRPA
GGGRFDVLGS ALLATSTALV VIGVYAGSPS LAVPGLLLLA AAIFVEWRSP SPVLDVRLFR
NYVFTASLAA ALLNYLATSA LQPSLSLLFQ KAFAMPPHYA GLLLSTQAAA MALFSPIAGR
AGNRLPLAAL AAAGSAILAV TLFAYSAAPS PATAPLALSL IGVGFALFIV PNTTIILSAA
PPERRGTASA LIAEARVVGM ALSNAAAGQI MKNVPNITAG VSSVLAFLAY VSLATLALSL
VRAGGRPRKR