Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0753 |
Symbol | |
ID | 5054495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 670771 |
End bp | 671826 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640468312 |
Product | myo-inositol-1-phosphate synthase |
Protein accession | YP_001152991 |
Protein GI | 145590989 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1260] Myo-inositol-1-phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.861409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.170909 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCATCA GGGTAGGCAT AGTGGGCGTA GGCAACTGCG CCTCTGCCCT CGTGCAGGGG ATTGAGATGT ATAAGCGCAA CCCGGAGATA GAGCCAGCCG TCGCCTTCAA GGACATAGGA GGCTACACCC CCCGCGACAT AGTCTTCGCG TCTGCCTTCG AGATCGACGC GAGGAAAGTC GGGCTCGACC TCGCCGAGGC CATCTTCCAG CCTCCCAACA ACGCCACTGT GGTGTACAAG CCCCCCAAGC TGGGGGTCGT GGTTAGGCCG GGGCCCGCGC TGGACGGCGT CCCCGCCGGC GGCCTCGTGC CGAAGGTGGT GGAGGGTACT GTGGAGGACG TGCTCAAAGA GCTCAACTCC ACCAACACCC ACGTCCTCGT CAACTACCTC CCCACGGGGG CGCAGAAGGC GGCTGAGGCC TACGCCGAGG CGGCTCTGAG AGCCGGCGTC GCCTTCGTCA ACGCCATGCC CGCCTCGATA GCCACCAGCG GGTATTGGCA GAGGAGGTTC CAGGAGAGGG GCGTGCCGCT ACTCGGCGAC GACACCCAGA ACCAGATCGG GGCCACCGTC TTCCACAAGA CGATAATTAG ACTACTGGCG CTGAGGGGGG TGAAGATAAG GGACACATAC CAGATAAACG TGGGCGGCAC CCCCGACTTC GTAAACCTAA TGTACAGGAA GGGGGACAAG GAGAAGACCA AGACAGCCGC CGTCAAGATG ATGGCGGAGG GCCAAGAATT CGACGCCTAC ATCTCCCCGG TGGCGTATAT ACACTTCCTC GGGGATAGGA AAATTGCCCA CACGCTAGTC GAGGCGGAGA TCTTCGGCGG CCTCACAATT AGGATTGAGG CAACGCTCGA CGTCCACGAC GCCTGGAACA GCGCCGCAGT CGTCACAGAC TCTGTGAGGC TAGCCAAACT GGCCATGGAC CGGAACATCG GCGGACCGCT GATAAGCGCA TCGGCGTGGG GCTTCAAAAA CCCGCCTGTC CACATGAGCC CCGAAGACGC GTACAAGGCA GTTCTCGAGT TCATAGAGGG AAAACGCGAT AGATGA
|
Protein sequence | MLIRVGIVGV GNCASALVQG IEMYKRNPEI EPAVAFKDIG GYTPRDIVFA SAFEIDARKV GLDLAEAIFQ PPNNATVVYK PPKLGVVVRP GPALDGVPAG GLVPKVVEGT VEDVLKELNS TNTHVLVNYL PTGAQKAAEA YAEAALRAGV AFVNAMPASI ATSGYWQRRF QERGVPLLGD DTQNQIGATV FHKTIIRLLA LRGVKIRDTY QINVGGTPDF VNLMYRKGDK EKTKTAAVKM MAEGQEFDAY ISPVAYIHFL GDRKIAHTLV EAEIFGGLTI RIEATLDVHD AWNSAAVVTD SVRLAKLAMD RNIGGPLISA SAWGFKNPPV HMSPEDAYKA VLEFIEGKRD R
|
| |