Gene Pars_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2219 
Symbol 
ID5054155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1988910 
End bp1990154 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content56% 
IMG OID640469772 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_001154417 
Protein GI145592415 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.437229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTGGA CTCCTGGAGA GCTGATATCC TTCACTGCCA AGGTTCTCTA CGAAATAAGC 
AAAGGTCTTA CTCTTGACTA CGCTTTTCAA AAGGTAAAGA GGGGGTGGCG TGAGTTAGAT
AGCTTCAAGG TATTTTACGA CGTGGTCTAC GACGCTGTGC GCCATTACTA TTTTCTCCAA
TTCGCCGCTT CGAAGATGTT CGGCTCTTCT GGCGCAAAAG CCATAGCTAA GGCGTGGTTT
ATTTTTAGGG CAGACTCTCT CCTCTACAAC AAAGACATGG TTTACAGCGT GCGGAAACGG
CTGTTAAAAA GGGCTCTGAC AAAGCCGGAC CACGTAATGG CGGCGTTGGA GGAGTTAAGG
GAGGATCGCG CCAGATACTT CTCGGTGAAG TACAGCTATC ACCCCAACAT AGTGTCGACA
CTGCTGTCGC ATTTCCCGCC GGAGGAAGTG GAGAGGTTGC TAGAAGCGGG GAATCACACC
TGGATTTGGC TGAGGATAAA CACGCTGAAG GCGGACGTGG ACAAGGCGTT GAGGCTGTTG
GAGGCCGAGG CCGAGGTGGA GCCCCATCCT AAAATTCCCT TCGCCGTGTT GCTTAAATCA
GCTAAGAGGC CTGTCCAGTA CCTAGAGGCC GTGAGGCGGT TTGTGGCCGT TCCCCAAGAC
CTGGCCTCAA TATACGCCGT GCTTTCGCTT AGGCCAGAGC CTGGCGACAG GATAATCGAC
CTCGCCGCGG CGCCGGGGAT GAAGACCAGC CTAATAGCCC AGCTAGCGGA GGGAAGAGCC
AAAATCGTTG CCGTGGACCT CTCGGCGAAG CGCGTTGCGA GGATGAGGCA CCTCCTGAAA
AACCTAGGAG CAGGGGACTT TGTAGAGGTC GTCAGGGCAG ACTCTCGGGT CTTAAAGACA
AGGAAGTTCG ACAAGGCGCT TCTAGACGCG CCTTGCACCT CCAGCGGGGC GTTCACCAAG
GAGCCCGCCG TAAAGATATA CCCCCGGGTC GAGGAGGCGC CTAAGTACTC CGCCGTGCAG
AAGGCCCTCA TCAAAAACGC ATTGGCGCTG GCAGAGGAGG TGGTGTACGC CGTCTGTAGC
ATCCTTCCAC AAGAAGGCGA AGAGGTGGCG GCGTCTGCCG GCGCAGAGGC GGAAAAGCCC
CATCCTGACC TCGCCCCGTC GTACACGCCC GGCGTCGGCG GGAGAACCTT CCCCCACATC
CACAGAAGCG AGGCCTTCTT CATATCGCGC TTGAGGAAAA GATAG
 
Protein sequence
MKWTPGELIS FTAKVLYEIS KGLTLDYAFQ KVKRGWRELD SFKVFYDVVY DAVRHYYFLQ 
FAASKMFGSS GAKAIAKAWF IFRADSLLYN KDMVYSVRKR LLKRALTKPD HVMAALEELR
EDRARYFSVK YSYHPNIVST LLSHFPPEEV ERLLEAGNHT WIWLRINTLK ADVDKALRLL
EAEAEVEPHP KIPFAVLLKS AKRPVQYLEA VRRFVAVPQD LASIYAVLSL RPEPGDRIID
LAAAPGMKTS LIAQLAEGRA KIVAVDLSAK RVARMRHLLK NLGAGDFVEV VRADSRVLKT
RKFDKALLDA PCTSSGAFTK EPAVKIYPRV EEAPKYSAVQ KALIKNALAL AEEVVYAVCS
ILPQEGEEVA ASAGAEAEKP HPDLAPSYTP GVGGRTFPHI HRSEAFFISR LRKR