Gene Pars_0753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0753 
Symbol 
ID5054495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp670771 
End bp671826 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content61% 
IMG OID640468312 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_001152991 
Protein GI145590989 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.861409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.170909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATCA GGGTAGGCAT AGTGGGCGTA GGCAACTGCG CCTCTGCCCT CGTGCAGGGG 
ATTGAGATGT ATAAGCGCAA CCCGGAGATA GAGCCAGCCG TCGCCTTCAA GGACATAGGA
GGCTACACCC CCCGCGACAT AGTCTTCGCG TCTGCCTTCG AGATCGACGC GAGGAAAGTC
GGGCTCGACC TCGCCGAGGC CATCTTCCAG CCTCCCAACA ACGCCACTGT GGTGTACAAG
CCCCCCAAGC TGGGGGTCGT GGTTAGGCCG GGGCCCGCGC TGGACGGCGT CCCCGCCGGC
GGCCTCGTGC CGAAGGTGGT GGAGGGTACT GTGGAGGACG TGCTCAAAGA GCTCAACTCC
ACCAACACCC ACGTCCTCGT CAACTACCTC CCCACGGGGG CGCAGAAGGC GGCTGAGGCC
TACGCCGAGG CGGCTCTGAG AGCCGGCGTC GCCTTCGTCA ACGCCATGCC CGCCTCGATA
GCCACCAGCG GGTATTGGCA GAGGAGGTTC CAGGAGAGGG GCGTGCCGCT ACTCGGCGAC
GACACCCAGA ACCAGATCGG GGCCACCGTC TTCCACAAGA CGATAATTAG ACTACTGGCG
CTGAGGGGGG TGAAGATAAG GGACACATAC CAGATAAACG TGGGCGGCAC CCCCGACTTC
GTAAACCTAA TGTACAGGAA GGGGGACAAG GAGAAGACCA AGACAGCCGC CGTCAAGATG
ATGGCGGAGG GCCAAGAATT CGACGCCTAC ATCTCCCCGG TGGCGTATAT ACACTTCCTC
GGGGATAGGA AAATTGCCCA CACGCTAGTC GAGGCGGAGA TCTTCGGCGG CCTCACAATT
AGGATTGAGG CAACGCTCGA CGTCCACGAC GCCTGGAACA GCGCCGCAGT CGTCACAGAC
TCTGTGAGGC TAGCCAAACT GGCCATGGAC CGGAACATCG GCGGACCGCT GATAAGCGCA
TCGGCGTGGG GCTTCAAAAA CCCGCCTGTC CACATGAGCC CCGAAGACGC GTACAAGGCA
GTTCTCGAGT TCATAGAGGG AAAACGCGAT AGATGA
 
Protein sequence
MLIRVGIVGV GNCASALVQG IEMYKRNPEI EPAVAFKDIG GYTPRDIVFA SAFEIDARKV 
GLDLAEAIFQ PPNNATVVYK PPKLGVVVRP GPALDGVPAG GLVPKVVEGT VEDVLKELNS
TNTHVLVNYL PTGAQKAAEA YAEAALRAGV AFVNAMPASI ATSGYWQRRF QERGVPLLGD
DTQNQIGATV FHKTIIRLLA LRGVKIRDTY QINVGGTPDF VNLMYRKGDK EKTKTAAVKM
MAEGQEFDAY ISPVAYIHFL GDRKIAHTLV EAEIFGGLTI RIEATLDVHD AWNSAAVVTD
SVRLAKLAMD RNIGGPLISA SAWGFKNPPV HMSPEDAYKA VLEFIEGKRD R