Gene Pars_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0640 
Symbol 
ID5054913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp567655 
End bp568725 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID640468199 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_001152883 
Protein GI145590881 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCA CTACAGTATT AGTAATTGGG ATAATCATAG TCGCCCTAGT AGTTGCAGTA 
GTTGCGTTGT TGTCACAACC TGCTTCTACT CCTACTCCCA CACCTAGTCA AACATCCCAG
CAACAAACTC AACAGCCACC CCCTACACGT TATAGTGTAA TAATAGCTAC AGGAGGGACA
GGAGGCGTCT ACTACTACTA TGGTGGGGTA ATTGCGGGGA TTCTAAAGAA CTATACAAAT
ATAGACGCAA CTTCTATTCA GACGGCAGGC TCTATTGATA ACTTACTCCT AATTAGAGAC
AAAACCGACC CCAAGCGGGG GATTTACTAC TGCGCCACGA CACTACCAGA GTCGGCTTAT
CTAGCTTACA CGGGACAACA TGAGAAATTC AAAGACAAAC CTGCACCTAT TGCTATACTG
TGGGCTATGT ATCCCAACTA CCTACATATT GTGACTAGGA GCGACTCGGG GATTAAGTCT
ATATACGACT TAAAAGGCAA ACGCGTCTCC ACAGGAGCTC CTGGAAGCGG CACCGAAATT
GAGGCTCTCC TTGTATTACA GATATTAGGC ATAGACCCTG CTAAAGACTT CTCAAAATGG
GAGAGGCTAG GCGCTGCTGA GAGCGCCGAC GCTTTAAAAA GCGGCACAAT TGACGCCTAT
TTCTGGAGCG GCGGCCTACC CACGTCCTCA ATTGTAGAGC TTGGAGTATC ATTAAAACAA
CAAGGCGTGT CGCTGGTGCT AATAGAGATA CCAGGTGAAG TTATTAATGC GTTCACCCAG
AAATTCCCAG GAGTCGCTAC CAAAGGCGTG ATACCGAAAA GCGTCTATGG TACTGAAAAA
GACACTCAAA CTTTAACTTT TTGGAATATG TTTGTATGCC ATAAAGATAT GCCGGATGAC
TTGGCGTATC TCATTACAAA AACTGTATTT CAACACCTTG ACATACTACA AGCTTCTGTA
AAAGCCGCAA AAGATACAAA TCTTCAGAAT GCGCTTCTCT ACTACGGCGG GAGTATACCG
TATCACCCAG GCGCCCTTCG CTACTACAAA GAAGTTGGCG TATTAAAGTG A
 
Protein sequence
MKSTTVLVIG IIIVALVVAV VALLSQPAST PTPTPSQTSQ QQTQQPPPTR YSVIIATGGT 
GGVYYYYGGV IAGILKNYTN IDATSIQTAG SIDNLLLIRD KTDPKRGIYY CATTLPESAY
LAYTGQHEKF KDKPAPIAIL WAMYPNYLHI VTRSDSGIKS IYDLKGKRVS TGAPGSGTEI
EALLVLQILG IDPAKDFSKW ERLGAAESAD ALKSGTIDAY FWSGGLPTSS IVELGVSLKQ
QGVSLVLIEI PGEVINAFTQ KFPGVATKGV IPKSVYGTEK DTQTLTFWNM FVCHKDMPDD
LAYLITKTVF QHLDILQASV KAAKDTNLQN ALLYYGGSIP YHPGALRYYK EVGVLK