Gene Pars_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2140 
Symbol 
ID5056178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1916517 
End bp1917533 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID640469692 
ProductABC transporter periplasmic-binding protein 
Protein accessionYP_001154338 
Protein GI145592336 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATAA GAAATTTATT GATTGTGCTC ATAGCGGTCG CTGGGGTGAT AGTGGCATTC 
CTAACGATAT CTGCTTTTCA GCAACCGCAG ACAAAGAAGC TCGTTATTGT CGGCCCCGCG
GGCATAAGCG ACTTAGGGCA GGCGTTGGCG AAAAAGTTCA GTGAGAAGTA TGGGGTGAAC
GCCACGTTTA TCCCACTTGG TGGGGCGGTA GAGATGGTGA ACGAGCTGGT GAGGAACAAG
GACAACCCAC CATGGGACGT CGCCATTGGG ATTCCGGAGT TTTACTACAC TGTGTTAGTG
GAAAGGGGTG TTTTACACTG TCCCAAGTTG TCTGTGGAGG GAGTCCCCCC CGAGGAGTTT
TGGGATCCCA ACGGTTGCGT GTACCCGCTG GATAAGTCCT ACATCGGCAT TGTATACAAC
GCCACTGCCC TCGAGAGGCT TGGGCTAAGG CCTCCAGAGA CGCTGGACGA CTTACTTAGG
CCGGAGTACA AGGGGCTAGT AACATATCCC AACCCAGTCC AGTCGGGCAC CGGGCTTGCC
GTCCTCTCGT GGATAATGTC AGTAAAGGGA GAAGAGGCAG GGTGGACATA TTTGAAACAG
CTCAGCGGCC AAATAGCCAA GGTGGGGTAT CCCAGCGGCT TTACGGCTTT GAGAAGCGCC
CTAAAACGTG GCGACGTGGT TATAGCGCTT TCATGGTACA GCCACGTGAT TGATCCTGGA
ACCCCTCACA TGAGGGCTGC CACTTACAGC GCCTTTTTGT ATAGAGAGGG CGTGGCAGTG
CTTAAAAATG CCAAAAACCG CGACTTAGCC GTGGAATTTG TTAAATTCGC CCTAAGCAAA
GAGGGGCAGG ACCTAGTAGA TCCTTACAAC TACATGCTCC CCGTCAGGGC AGACGCCGTG
GTGAAGAACA ACGTGGGCCT TCCCCAGCCG CGGTCGGTGG TGGTCTACAA CCCTGCCCTC
GGCTCCAAGG CAGACGAGTG GAGGCTGAGG TGGCAGAGGG AGGTCGCTTC AGGTTAA
 
Protein sequence
MSIRNLLIVL IAVAGVIVAF LTISAFQQPQ TKKLVIVGPA GISDLGQALA KKFSEKYGVN 
ATFIPLGGAV EMVNELVRNK DNPPWDVAIG IPEFYYTVLV ERGVLHCPKL SVEGVPPEEF
WDPNGCVYPL DKSYIGIVYN ATALERLGLR PPETLDDLLR PEYKGLVTYP NPVQSGTGLA
VLSWIMSVKG EEAGWTYLKQ LSGQIAKVGY PSGFTALRSA LKRGDVVIAL SWYSHVIDPG
TPHMRAATYS AFLYREGVAV LKNAKNRDLA VEFVKFALSK EGQDLVDPYN YMLPVRADAV
VKNNVGLPQP RSVVVYNPAL GSKADEWRLR WQREVASG