Gene Pars_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1198 
Symbol 
ID5055983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1085469 
End bp1086944 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content63% 
IMG OID640468746 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001153419 
Protein GI145591417 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.135202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.356317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTT TGCTCGCGGC GGCGGGGCTC GTGCTGGTAT ATGCGGGTAT TGTCTCTTTG 
GTAAGTCCAG GCGAGCCCGC CGCCTCTCCA CAACCTTGGG GATTCAGCAG AGGCATAGCC
GTGAGGCCCC CGGAGGTGAG CTTGTCGGGT GGGCTGTACT TCAAGACCTA TCTCTGGGCC
CGGGGCGGGG GCGGGGTCTT GTACCTACGT TGCTACGTGT ATAGTAGGTA CGAGGGGGGG
ACGTGGTTGC CAACTCCGGA GGAGTACCCA GTCCTCGGCG TCTACTCCGT GACGGTGGGG
AGAGGTCCTT TTACAGAGGG CGGAGCTTTG AACTTAACTC TGCCGCTTAT GGGCGGCTGC
GTCCCGGTGG CTACACCTTC TGTAGACGGG CTTCAGCTGG GGTCGATAAA GGTGTCCGCT
CCCGGCGCCA ACTTAGCCGC CAGCCGCACA GGCCTGTACG TAGCATACGC CGGGGGGAGA
TTGGGCGAGG TGACCTCTTA CTACGGCCCA GGTGATTTGC CGCCTTCGCC GGATGACTTG
TCGCTCCCTT CCGGGCAGGC CGGGGTTCTG CTGGCTCTTG CCCGAAACAT AACCGCCGGT
TGTGGAGATG TGTCGTGTAA GGTTGAGAGG ATTAAAGAAT TTCTAAAGGG GTTTACCTAC
GACGGGACAA TGGATGCGCC ATGGCCCCAT ATCCCCCCCG GCGTGGACCC CTTGATGTGG
TTTCTCCAAA ACAAGAGGGG GGTCTGCGTC CACTTCGCCA CAGCCTTTGT AATGTTGGCG
AGGGCTTCCG GGGTGTATGT CAGGCTCGTG GTTGGCTACA TGAGCGATGG CCCCGTGCCG
ACGCAGTGGT CGCTGACGGC CTTCAGCCCC CACGCCTGGG CTGAGTACTA CCAGCCGGGC
GTGGGCTGGA TCGGGGTAGA GGCGACTCCT CCCATGGGCG CCCCATCGCC GGCGGCGCCG
CCGCCTCGCG AGACGCCTAC CCCAGCCGCG ACGACGCCGG AGGTACCGCC CGCCTCGCCG
GGTCCCTACC AGTGGCCATC TATTGGCCTC GGCGTGTTTC TGCCTCTGGC TGGGATGGCC
GCAGCGGCGC TGGTGGGTGG GGCCCTCTTC AAGAAGAGGG TGGTAATCAC TGTGGGGGAG
GCGCTGAGGG TGGGCGCACC CAGGGGGTTT TGGGTGTATG TTAACAGAAG GCGAGTGGGA
CGCGCGCCTG TGGAGATCGT CTTCGACAAG CCTGGCCTCT ACCTCGTAGC GGTGGGGCCC
TTTGTGCGGG TGGTGAGGGT TGTGGACTAC AGGTCTATGG CTGGGAGGGC TTTTGAGAAG
CTCCTCAAGA AGCTTAAACT CCCCCCTTCA GCTACGCCGC GGGAGGTCGC TGAGAGGTTT
CCGCAGTACC GCGAAGTGGC GCTTCTCGTC GAGAAGCTTA GATTCGGCCC ACGGGCCGAC
AATGAAGACT ACAGGCGGCT AAGGGAGATG TTATGA
 
Protein sequence
MRILLAAAGL VLVYAGIVSL VSPGEPAASP QPWGFSRGIA VRPPEVSLSG GLYFKTYLWA 
RGGGGVLYLR CYVYSRYEGG TWLPTPEEYP VLGVYSVTVG RGPFTEGGAL NLTLPLMGGC
VPVATPSVDG LQLGSIKVSA PGANLAASRT GLYVAYAGGR LGEVTSYYGP GDLPPSPDDL
SLPSGQAGVL LALARNITAG CGDVSCKVER IKEFLKGFTY DGTMDAPWPH IPPGVDPLMW
FLQNKRGVCV HFATAFVMLA RASGVYVRLV VGYMSDGPVP TQWSLTAFSP HAWAEYYQPG
VGWIGVEATP PMGAPSPAAP PPRETPTPAA TTPEVPPASP GPYQWPSIGL GVFLPLAGMA
AAALVGGALF KKRVVITVGE ALRVGAPRGF WVYVNRRRVG RAPVEIVFDK PGLYLVAVGP
FVRVVRVVDY RSMAGRAFEK LLKKLKLPPS ATPREVAERF PQYREVALLV EKLRFGPRAD
NEDYRRLREM L