Gene Pars_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0161 
Symbol 
ID5056356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp147036 
End bp148256 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content60% 
IMG OID640467740 
Productadenylosuccinate lyase 
Protein accessionYP_001152428 
Protein GI145590426 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.576792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATGT ACACCTCGCC TTTTGATTGG CGCTATGGGT CGGAGGAGAT GCGCCGCCTC 
TTCACGCCGC AGGCGTTTAT CGACACGTAT CTGGAAGTGG AGAGGGCGCT TGTCTGCGCC
TTGGAGGAGC TGGGGATAGC TGAGAGGGGG TGTTGCGAAG CCGTAAGCAA GGCGCGGGTA
GGCGCTGAGG AGGTATACGC CTTGGAGAAG GAGACGGGCC ACGACATCCT CAGCCTAGTA
CTGTTGCTGG AGCAGAGGAG CAACTGCCGC TTCGTGCACT TCGGTGCCAC CTCAAACGAC
GTTATAGACA CGGCGTGGGC CCTCCTGATA AGGCAAGCGA TCTCTCTGGT CAAGGAAAAG
GCCAAGGCTG TGGGGGAGGA GCTGACCCGC CTGGCGAGGA GGTACAAGGA GCTTGAGATG
GTGGGGAGGA CCCATGGCCA GTGGGCAGAG CCAATCACTC TAGGCTTCAA GTTCGCAAAC
TACTACTACG AGCTGTACAT CGCGTGTAGG CAGCTGGCGC TGGCCGAGGA GTTTGCAAGG
GCTAAGATCG GCGGCGCAGT GGGCACCATG GCCTCTTGGG GGGAGCTGGG CCCCGAGGTG
AGGAGGCGGG TCGCCCAGCG GCTGGGTCTG CCGTACCACC CCATTACGAC GCAAGTGGCG
CCGCGGGAGG CCTTCGCCGT CCTCGCCTCG GCGCTGGCGC TGATGGCCGC GGTGTTTGAG
CGCCTAGCCG TGGAGATAAG GGAGCTTTCT AGACCGGAGA TCGGGGAGGT GGTGGAGCGG
GGCGGCGGCT CTTCGGCCAT GCCCCACAAG GCAAACCCCA CGGCGTCTGA GCGCATCGTG
AGCTTGGCGA GACACATCAG GGCGCTACTC CACGTCGCAT ATGAGAACAT AGCGCTTTGG
CACGAGCGCG ACTTGACAAA CTCGGCAAAC GAGCGGGTTT GGATCCCCGA GGCCTTCCTC
GCCGTCGACG AGATCTTAGC CACGGCATTG AGGGTGTTGC GCAATGTGTA CATAGACGAG
GCAAGGATTC AAGAAAACTT GCAGAAGGCC CTACCCTACA TCTTGACGGA GTTCCACATG
CTAAGGATGA TAAGAGAGGG GGTAAGCAGG TCTGAGGCTT ATAAGAAGGC CAGGGAGATA
AGGGCTGTTG TGTACGACTA CCAGCGCTGG CCTGTGGATA AGCTAATTGA GGACGCCCTT
TCCCTAAAGC TTTGCGAATA G
 
Protein sequence
MGMYTSPFDW RYGSEEMRRL FTPQAFIDTY LEVERALVCA LEELGIAERG CCEAVSKARV 
GAEEVYALEK ETGHDILSLV LLLEQRSNCR FVHFGATSND VIDTAWALLI RQAISLVKEK
AKAVGEELTR LARRYKELEM VGRTHGQWAE PITLGFKFAN YYYELYIACR QLALAEEFAR
AKIGGAVGTM ASWGELGPEV RRRVAQRLGL PYHPITTQVA PREAFAVLAS ALALMAAVFE
RLAVEIRELS RPEIGEVVER GGGSSAMPHK ANPTASERIV SLARHIRALL HVAYENIALW
HERDLTNSAN ERVWIPEAFL AVDEILATAL RVLRNVYIDE ARIQENLQKA LPYILTEFHM
LRMIREGVSR SEAYKKAREI RAVVYDYQRW PVDKLIEDAL SLKLCE