Gene Pars_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0220 
Symbol 
ID5056416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp198698 
End bp199750 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content61% 
IMG OID640467799 
Productradical SAM domain-containing protein 
Protein accessionYP_001152487 
Protein GI145590485 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGA GAGGCTTAAA CCGCTCAGAT GCCCTTTACC TCATGCGCGA AGCGGACGTC 
TTCACCTTGG CCAAAGCCGC TGAGGAGCTG ACGCGGAAGT ACTACGGCGG CGTTGTGACC
TTCGTCAACA ACGTGGTTAT CAACTACTCG AACGTGTGCG TTGCCAAATG CCCCATCTGC
GCCTTCTATA GGCTCCCCGG CCACGGGGAG GGCTACGTGA GGAAGCCCGG GGAGGTGGCG
GCGATGGTGG AGCGCTTTGC GAAAGAGCTC GGCGTCACCG AGCTCCACAT AAACGGCGGC
TTCAACCCTT TCCTGACGCC GGAGTACTTC GATGAGCTGT TCCGAGAGGT GAAGAGGAGG
GTTCCCCGCG TGGCGATAAA GGGTCCCACC ATGGCTGAGG TGGACTACTA CGCCAAGCTG
TGGCGCGTCT CGCGGCAGGA GGTCCTATCG CGCTGGAAAG AGGCGGGGCT AGACGCCATT
TCGGGCGGCG GCGCCGAGAT ATTCGCAGAG GAGGTCAGGA AGGTGGTTGC CCCCCACAAG
ATATCTGGCG AAGAGTGGCT CGAAATTGCG GAGCTGGCCC ACAAGATGGG CATACCCAGC
AACGCCACCA TGCTCTACGG ACACGTGGAG AGAGAGGAGC ACGTGGTAGA CCACATATTC
CGCGTCAAGG ACCTCCAGGA GAGGACTGGG GGCCTCCTCC TCTTCATCCC CGTTAAGTTC
AACCCAGATA ACACGGAGCT CAAGGCGAGG GGGGTCGTCG CGAGGCCGGC CCCCTCCACC
TACGACGTGA AGGTGGTGGC CATAGCGAGG CTGATCCTAG GGGACAGGCT AAAGGTGGCT
GCCTACTGGC TCTCCGTGGG CAAGAAGCTG GCCTCCACCC TCCTACTTGC CGGCGCCAAC
GACCTAGTGG GGACGATGTA CAACGAGGCT GTGCTCACCT CGGCCGGGGC GAGGCACAGC
GCGTCGGTGG AGGAGTTGGC AGAAATTGCA AGAGAGGTGG GCAAAACACC TGCACTGAGG
GACACATTCC ACAGAGTACT GGCCTACTTG TAG
 
Protein sequence
MAERGLNRSD ALYLMREADV FTLAKAAEEL TRKYYGGVVT FVNNVVINYS NVCVAKCPIC 
AFYRLPGHGE GYVRKPGEVA AMVERFAKEL GVTELHINGG FNPFLTPEYF DELFREVKRR
VPRVAIKGPT MAEVDYYAKL WRVSRQEVLS RWKEAGLDAI SGGGAEIFAE EVRKVVAPHK
ISGEEWLEIA ELAHKMGIPS NATMLYGHVE REEHVVDHIF RVKDLQERTG GLLLFIPVKF
NPDNTELKAR GVVARPAPST YDVKVVAIAR LILGDRLKVA AYWLSVGKKL ASTLLLAGAN
DLVGTMYNEA VLTSAGARHS ASVEELAEIA REVGKTPALR DTFHRVLAYL