Gene Pars_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0594 
Symbol 
ID5054853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp529856 
End bp531061 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content59% 
IMG OID640468153 
Productglycosyl transferase, group 1 
Protein accessionYP_001152838 
Protein GI145590836 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAGA AATACGCCCA ATTCGTGGGG GAGGATGAAA TAGACGCAAT AGTCAAGCTA 
GCGGAGCGGC TCCAAGACCT CTCCATCCTA CACGTGAACT CCACCGCCGC CGGGGGCGGC
GTTGCGGAGA TCCTAAACAG GATGGTGCCC CTAATGCGGG AACTAGGGCT AAGGGTTGAC
TGGAAGGTGA TACGGGGCGA CGCGGAGTTC TTCACCGCAA CTAAGACTTT CCACAACGCC
CTCCAGGGCA CGGTGCGGGA TGTGCCGAGC CATCTGTATT CCGTATACGA GAAGTGGCAG
GAGATAAACG CCAATGAGCT GGATCTCGAC TACGACGTGG TTTTCATACA CGACCCCCAG
CCAGCCGGCC TTATCAAGTA CAGGAAGAGG GGAAAGTGGA TTTGGAGGTG CCACATAGAC
CTCTCTACGC CGCATCCCGA GGTCTGGGCG TTCCTCAAGC GGTACGTCTC CATGTACGAC
CTCGCCATAT TCCACATACC CGAATTTGCC AGAGACGACC TGGAGATACC CCAGCTCCTC
ATACCCCCCT CGATAGACCC CCTCAGCCCC AAAAACAAGG AGCTGCCGCC AACCACGGTG
GAGCGCATAG TGGCGAAATT CGACGTCGAC ACGGAGAGGC CAATCTTGTT GCAAGTCTCC
CGCTTTGACT GGGCAAAGGA TCCCCTGGGC GTCGTGGAGG CGTATAGGCT GGCCAAGCGC
CACGTCCCCG ACCTCCAGCT GGTCTACCTG GGTAGCCCCG CGCACGACGA CCCCGAGGGG
GAGGCCGTCT ATAAGAAAAC TGTGGAGGCC GCCGGCGGCG ACTCAGACAT ACACCTCCTC
ATGCTACCAC CTGACAGCCA CGTGGAGGTC AACGCCTTCC AACGCGCCGC CACTGTGGTG
ATGCAGAAGT CTATAAGAGA GGGCTTCGGG CTCACGGTCT CGGAGGCGTT GTGGAAAAGC
AAGCCGGTGG TAGGCGGAAG GGCGGGGGGC ATTAAAATCC AGGTAATACA CGGAGTCACC
GGCTTCCTCG CCACCTCCCC CCGCGTCGCC GCCCACTACG TCACCTTCCT CCTCAGGGAG
AAGGAGATAA GGGAGAAGAT GGGCGCCGCA GGCAGAGAAC ACGTCCGGAG AAACTTCCTA
ATAACCCACC ATCTGAGGCG CTACCTCATG GCAATAGCCT ACGCAACCGG GAGACACAGA
AGATAG
 
Protein sequence
MIEKYAQFVG EDEIDAIVKL AERLQDLSIL HVNSTAAGGG VAEILNRMVP LMRELGLRVD 
WKVIRGDAEF FTATKTFHNA LQGTVRDVPS HLYSVYEKWQ EINANELDLD YDVVFIHDPQ
PAGLIKYRKR GKWIWRCHID LSTPHPEVWA FLKRYVSMYD LAIFHIPEFA RDDLEIPQLL
IPPSIDPLSP KNKELPPTTV ERIVAKFDVD TERPILLQVS RFDWAKDPLG VVEAYRLAKR
HVPDLQLVYL GSPAHDDPEG EAVYKKTVEA AGGDSDIHLL MLPPDSHVEV NAFQRAATVV
MQKSIREGFG LTVSEALWKS KPVVGGRAGG IKIQVIHGVT GFLATSPRVA AHYVTFLLRE
KEIREKMGAA GREHVRRNFL ITHHLRRYLM AIAYATGRHR R