Gene Pars_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1575 
Symbol 
ID5055115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1425092 
End bp1426228 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content57% 
IMG OID640469116 
Productmajor facilitator transporter 
Protein accessionYP_001153781 
Protein GI145591779 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000133383 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGCACAT CAAAGCCTAA GCTCCTCGCC TACGCGCTCT TCTCAGCCCC GTACTCATTC 
GTCGTATTCC TGTTACCTTT CTATGTGTTT GAGGTAGGAG GCGACAAAGC CGCGGTCGGC
ATATCCTTTG CCATGTACGC CCTAGCCATT GTCGTGTTCC GTCCCGCCGC CGGCTGGCTT
GCAGACGTAG TGGGTAGACG TGTAACGAAC ATAGCCGGGG GCGTGGCGCT TGCCGTGGCC
ATGGCCATTT TGGGCATATC AACGGAGTTG TCGCACATAT ACACAGCCTT GTTTCTAGCA
GGGGTGGGAT CTAGTCTAAT AAACGTAGCG ATGATAGCCT ACGTGTCAGA CGTAGGGGGC
TTAGAAAATC CCACACTGTA CTCAAAAATG AGGATTGCCG CCGCCTTAGG AGCCGTTGGC
GGAGGGTTCT CGATCCCTGC CGCCTATGTC TTATCTAAGG CTTGGGGATA CGCTGCGGCA
TTCAAAGCGT TGGCCGTAGC GATGTCGCTT ACAACAGTTG TCGCTCTCGC TCTCGTCCCC
GAGGAGACAG CCCGCCTCGC GTTGCGCCAC AAATCAGGAG ATGTGGCGGC GGCGTTCTGC
ATAACGGCGA TGGGGTTTTT CATAGGCGCC GCCACGGGGG TGTACGGCCC CCAGATCCTG
CCCTATATCT ATGCCAAATT CTCCTTATCT CCCTTCGCCG CGGTGTTGGT ATACCTCCCG
GCCGTTGTTG CATGGCTGAT AGGGCCAAAA CTGGCGAGGC CCACGGCGCT TTCCGCAATC
ATAGGCGGTG CGTTAATGTC GGCGGCCCTC GTTGCGATGT ATCACTCGCC TAATCCTGCC
CTATTTTCCG CCGTCTGGCT CGCGGAGAGC CTCGGCATCG CGATAGTGTC GACATCGCTG
GACCAAGCGC TATCGAGGCA CGTAAAGGGA GCATACTGGG GGCGTGGATA CGGCGTATAT
CAATCAGTTT ACAACATGGG CTATGCCTTG GGGGCGGCCG CATCAGGATT TCTGCCGAAT
CCGTTCTACA CCGCCCTCCT ACCACTTGTT GCCTTCTTCG CACTAGCCGC TATATGCCAA
GGTAGCCGAC GAATATCCCC AACGGGATCA CCAGCCCCAC GGCTAGGTTT AGGTTGA
 
Protein sequence
MSTSKPKLLA YALFSAPYSF VVFLLPFYVF EVGGDKAAVG ISFAMYALAI VVFRPAAGWL 
ADVVGRRVTN IAGGVALAVA MAILGISTEL SHIYTALFLA GVGSSLINVA MIAYVSDVGG
LENPTLYSKM RIAAALGAVG GGFSIPAAYV LSKAWGYAAA FKALAVAMSL TTVVALALVP
EETARLALRH KSGDVAAAFC ITAMGFFIGA ATGVYGPQIL PYIYAKFSLS PFAAVLVYLP
AVVAWLIGPK LARPTALSAI IGGALMSAAL VAMYHSPNPA LFSAVWLAES LGIAIVSTSL
DQALSRHVKG AYWGRGYGVY QSVYNMGYAL GAAASGFLPN PFYTALLPLV AFFALAAICQ
GSRRISPTGS PAPRLGLG