Gene Pars_0356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0356 
Symbol 
ID5055782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp305868 
End bp307055 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content39% 
IMG OID640467927 
Productglycosyl transferase, group 1 
Protein accessionYP_001152614 
Protein GI145590612 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC TTCACATAGC CCCCTTCTTT ACAGGGGGCG TTGGGATAGT AGCGAAGAAC 
CTTACGTATG CACTTTCTAG ACTAGGTAAT GATGTTGTTA TCGCATCCCC TGCTAATCCG
CCAGAGGAAA TACGCGAAAA CATTTCTACT TTTTACAAGC TTAGGGAGCT GGTACTTAAA
GACCCCTTAT ACGCGCCAAT GTTTGCATTA ATAAATAAAA ATATTGTTGA AAAGATAATT
AGAGACGTGA AACCCGATAT CATTTTAACT CATGGTCCGC TTACACTTCT AGCATCACTT
ATAAAAACTA AGATACCTTG GTTCTCAGTT GTTCATGGCA CGTATTTCAA TGAACTAAAG
TATATGTGGC AACACCCGAT CCGAGGTATT GAAAAACTGA AGTATTGGTT ATCAATAGGT
ACAACATACC ATGAAGATAT GGAAATTTAT AGATATGTAA CGAAAAAAAG AAATATTTAC
CTTGTCGCAG TTTCTAAAAG AACGAGGCAA GAGCTTATAG ATGCCGGTGT CAATCCGCAA
AGAGTTTTTT CGGTTTTAAA TGGAGTTGAT AAAAACATAT TCAAACCGAT GGATAAAGAC
AAGAACCTAT CGATTCTCGA AAAGAAGTAT GGTGTTGAGG TTGATAATGA AAGGCTACTT
CTCCATGTTA ATCCAGGCGC GATAAAGGGT ACTCATATAC TCATTAAGTC TATTGCCATC
CTCAAGAAAA TATTGAAAGA TAGAGTGATG TTACTCGTCG TGGGTAACAT AGGGCCCAGT
ACATATAGAA GCTACATAGA GAGGCTAGTC AAAGAAATGA AGCTAGAAGA CACGGTCAAG
TTTATCGGGC GAGTACCTCA TGAAGAACTC CCATACTTCT ACAATATAGC TGAACTCACA
ATCGTCCCGT CATATTCGGA GGGTGCACCA TTAGTAATAC CTGAGTCGCT TGCATGCGGA
ACGCCCGTAG TCGCGACAGA AGTCGGTGGC AATTCTGAAT ATCTAAGGCT GGCACTACCC
AAACCTGACA AATACCTCGT AGAGATCAAA GAATACGATT TTTCCAAAAC ATTGGCAAAG
AAAATAGGTA TGGCTCTTAG CTATAGAGCT ATCCCTAATA TAGAATCTAT CCCCTCATGG
TTTGATATTG CTAAGATCTA TCTTAAATTA TTTAGAGAAA AATCGTAA
 
Protein sequence
MKILHIAPFF TGGVGIVAKN LTYALSRLGN DVVIASPANP PEEIRENIST FYKLRELVLK 
DPLYAPMFAL INKNIVEKII RDVKPDIILT HGPLTLLASL IKTKIPWFSV VHGTYFNELK
YMWQHPIRGI EKLKYWLSIG TTYHEDMEIY RYVTKKRNIY LVAVSKRTRQ ELIDAGVNPQ
RVFSVLNGVD KNIFKPMDKD KNLSILEKKY GVEVDNERLL LHVNPGAIKG THILIKSIAI
LKKILKDRVM LLVVGNIGPS TYRSYIERLV KEMKLEDTVK FIGRVPHEEL PYFYNIAELT
IVPSYSEGAP LVIPESLACG TPVVATEVGG NSEYLRLALP KPDKYLVEIK EYDFSKTLAK
KIGMALSYRA IPNIESIPSW FDIAKIYLKL FREKS