Gene Pars_0961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0961 
Symbol 
ID5055929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp852429 
End bp853391 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content52% 
IMG OID640468517 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001153193 
Protein GI145591191 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.233776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000990289 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGCCC TAGTAAAGGC CGAGAGACTA AGCAAGTACT TCCCAGTAGG CGGGTTGCTG 
AGACCGCGCG GCTACGTCAA GGCCGTCGAC AACGTTGACT TGGAGATCTA CGAGGGGGAG
ACCCTTGGGC TGGTTGGCGA GACGGGGAGT GGGAAAACCA CTTTAGGGAG ACTCATACTT
AGACTAATAG AGCCCACCTC GGGCAAGATA TACTTCGACG GCGCCGACGT CACGAAACTT
TCAGGTAAGG AGCTGGCCAC GTTTAGGAGA AAGGCTCAGA TAATATTCCA AGATCCCTAC
ATGTCGCTTA ACCCGCGCTT TACTGTTTAC CAAACACTCC TCGAAGTAAT TAAAGTTCAC
AAACTGCCTA TACAAGACCC AGAAGAGCAC ATAGGCAAAA TGCTGGAGCT TGTGGGGCTC
GAGAGGAGCC ACCTCCACCG CTACCCGCAC GAGTTCAGCG GAGGCCAGAG ACAGAGAATT
GCGATACTCA GGGCACTGAT ACTTGAGCCC AAGTTCCTAG TACTCGACGA GCCGACCTCG
GCGCTCGACG TGTCGGTACA GGCCCAGATT TTAAACATGT TGAAGGACCT TCAGAGGCGC
CTCGGCCTTA CATACTTATT CATTAGCCAC GATATAGGAG CCGTCCGATA CATGAGCAAT
AGAATTGCAG TAATGTATAT GGGCAAAATC GTGGAAATAG GCCCCGTAGA CGCCGTAATT
AAGGAGCCTT TACACCCATA CACCCAAGCC CTCATTTCGG CGCTTCCAGT GCCAGACCCC
AAGATAGCTA GGAGCAAAAA GACGGTGCTC TTGCAAGGAG AACCGCCAAG CCCAATAAAT
CCGCCTGCCG GTTGCCGCTT CCACCCCCGC TGCCCCTACT TCATAAAAGG AAAATGCGAC
GTGGAAGAGC CCCAGTTGAA AGAGGTAAAA AGTGGCCACT ACGTCGCGTG TCACCTATAC
TAA
 
Protein sequence
MKALVKAERL SKYFPVGGLL RPRGYVKAVD NVDLEIYEGE TLGLVGETGS GKTTLGRLIL 
RLIEPTSGKI YFDGADVTKL SGKELATFRR KAQIIFQDPY MSLNPRFTVY QTLLEVIKVH
KLPIQDPEEH IGKMLELVGL ERSHLHRYPH EFSGGQRQRI AILRALILEP KFLVLDEPTS
ALDVSVQAQI LNMLKDLQRR LGLTYLFISH DIGAVRYMSN RIAVMYMGKI VEIGPVDAVI
KEPLHPYTQA LISALPVPDP KIARSKKTVL LQGEPPSPIN PPAGCRFHPR CPYFIKGKCD
VEEPQLKEVK SGHYVACHLY