Gene Pars_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1765 
Symbol 
ID5054401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1585323 
End bp1586738 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID640469310 
Productmajor facilitator transporter 
Protein accessionYP_001153968 
Protein GI145591966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.719018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000452829 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGATAGCT ACGATATTAG ATACGCCTGG AGAGCAACGC CACTTTTAGG CTCTGTCGCT 
CTTTTGGTAA TGTACACAGA GGCCATGTTG ATGCCGAGCC TCCCCAAGAT ACAGGCCGAG
TTTAACGTCA CTCCCGCAGA CGCCTCTTGG ATTTTGACAA TATACCTAAT TTCTGGCACT
ATAAGCGCCG CTATTTTTGG AAACCTCGGC GACATATACG GGAAGAAGAA GGTGCTTTCT
ATCGTGATGG CAGCCTATGC GGTAGCGGTG ACCTTTACAG GCTATGCCCC GAATTTCGGA
TCTTTGCTAC TCTCGCGTGC CATACAAGGC ATGGGAATGG CGATGTTCCC ACTGGCTTTT
TCGCTTATCC GAGAAGAATT CCCGCCACAC ATGGTGCCCA CGGCCCAGGG AGTTGTAAGC
GCGATGTTCG GTGTAGGTAT TATAATAGCG TTGCCCGTCG GAGCCTATAT AGCTCAGAAC
TACGGGTGGA GAGCCACATA CCACACAGCG ACGCCAATAG CCGTCTTGCT CACCTACTTA
ATAGTCACCT ACATAAGAGA GAGTCGGTAC AGAACGCCCA GGAAAATCGA CTTTGTAGGA
GTCGCCCTCT TCTCCTCTAT GGCGGCGTCA TTTCTGCTCG CCATATCTAA AGGCCCAGAT
TGGGGGTGGT TTTCGCCAAG GATCACCTCG TTGTTTATAC TTTCGGCTGT GTCTGCCGCC
GTTTTTGTAA TCCACGAACT GATCACAGAC AGCCCCTTCA TACCGAGAGA TATCTTTAAC
AGAAACGTAA TAGCGGCGAC GATCGCAATT CTAATAGTGG CATACGCGTT TCAGATGAAT
TCCCAAAATT TGTCGTACCT ATTCCAGATG CCGCCGCCTT ACGGCTATGG GCTAACAATT
CTGCAGACCG GTCTCTACAT GTTGCCTCCA GCTATGGTTC AGATAATTGT CGCCCCGCTT
TCTGGTAGAT TAATGTGGAG GCTTGGGGCA AAGAGAATTG CTTCACTCGG CGTTGTTTTC
GCCGTGGTTG GCTACCAGCT AGCCGCCGCA CACCTCTACA GCGGCGTATG GACGCTAATC
TCATACATGA CTCTGGGCTT TGTAGGATTG ACCTTGTTAA ACGTCTCACT TATAAATCTC
CTCACGTTCT CTGTGCCTAG AGAGAGACTG GGCGCCGCCA CCGGCCTCAA CACTGTTTTT
CGCAATTTTG GCTCAGCTAT CGCTCCCACC GTTGCAGGTA CAGTATTGAC AAACTTTAAT
ACCTATATCT ACTACAATAC ACCAGTGGGA TTGGTCTACT TCTCTGTGCC TTCAAAAGAG
GCGTATATAA TAAACATCGA CATTGCCACC ATTATGTTTA TCATATCGCT TGTACCAATA
TTAATATCAA AAGAAATTCT AAGACTGAAC AAGTAG
 
Protein sequence
MDSYDIRYAW RATPLLGSVA LLVMYTEAML MPSLPKIQAE FNVTPADASW ILTIYLISGT 
ISAAIFGNLG DIYGKKKVLS IVMAAYAVAV TFTGYAPNFG SLLLSRAIQG MGMAMFPLAF
SLIREEFPPH MVPTAQGVVS AMFGVGIIIA LPVGAYIAQN YGWRATYHTA TPIAVLLTYL
IVTYIRESRY RTPRKIDFVG VALFSSMAAS FLLAISKGPD WGWFSPRITS LFILSAVSAA
VFVIHELITD SPFIPRDIFN RNVIAATIAI LIVAYAFQMN SQNLSYLFQM PPPYGYGLTI
LQTGLYMLPP AMVQIIVAPL SGRLMWRLGA KRIASLGVVF AVVGYQLAAA HLYSGVWTLI
SYMTLGFVGL TLLNVSLINL LTFSVPRERL GAATGLNTVF RNFGSAIAPT VAGTVLTNFN
TYIYYNTPVG LVYFSVPSKE AYIINIDIAT IMFIISLVPI LISKEILRLN K