Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1777 |
Symbol | |
ID | 5055522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1597678 |
End bp | 1598769 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469322 |
Product | nucleotidyl transferase |
Protein accession | YP_001153980 |
Protein GI | 145591978 |
COG category | [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.13003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0200486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAG CCCCTGTCGT ACTAGCGGGC GGCACGGCCG GCATATTTGA AAAATTGACA GGTCAACTCC CCAAGACTTA CATAAAAATC GGCGGGAAAA GACTATACCA ATACACCGCC GACGCGTTAC TGGCTATATT CGGAAAAGTC TACGTAGCGG CGCCCCGCCC CGAAGATAAC CCCTACATCT ATGTAGAAGA AAGAGGGACT GGAGTCGAGA GGGCAATCTC CGCGGCAGAG GCCTACTTAG GCGCTGAGAC CCATATGCTG ATAGCATATG GCGACGTTTA TGTAGAGAGC TATGCGTACA GGTCTCTAAT AGAGGCGACA GCTACCACAG GCGCAGACGG CGCTGTGCTC GCCGTGCCGA GAAAAGCCAC AAAGGGATAC GGCGTACTGG AGACGAAAGC CGGGACCCTC CTAGCCAAAA TAGGGGGAGA AGGCCAGTGG ATATTCGGAG GATTAGCCCT CCTCCCACGT GCCGCACTAA GGATAATAGA ACAAGCCGGG CTTTACGAAG GCTTAAACCA AATAGCACAG CGATCAAAAA TAGCGGTTGT ACCGTGGAGC GGGACGTGGC ATGATGTAAA TCACCCAGAA GACTTAATGC AACTGCTAGA GTACACAGCG CCGAGGAATA CCATTATTGC TAAAACCGCC AAGGTAAGCC CCACTGCCGT GTTGGAGGGA CCCGTCGTAA TTGAGGATGG CGTAGAGATA GACCACTACG CCGTGATAAA AGGTCCTGCC TACATAGGGA AGGGGGCTTT TATAGGAGCC CACGCACTTA TACGTAACTA CACCGATATA GAGGAGGGCG CCGTCATCGG AAGTAGCACA GAAGTAAGCC ACAGCCTCAT CTGCGAGAGG GCCACTGTGG GGAGGGGGTC CTTTGTCTCC TACAGCGTAG TAGGCGAAGA GGCAGTTCTA GAACCCAACA TTGTGACTAT GTCGGTACTC AGAGAGGGGC GCGATAGACT AGAACCAATA CAAGTAAGAG GCCAGGTATA CTACAAACTA GGCGCCTTAA TACCGCGGAA AGCCCGAGTA TCCGCAGGCA CAACACTACC TCCAGGAGCT GGCTGGGACT AA
|
Protein sequence | MKIAPVVLAG GTAGIFEKLT GQLPKTYIKI GGKRLYQYTA DALLAIFGKV YVAAPRPEDN PYIYVEERGT GVERAISAAE AYLGAETHML IAYGDVYVES YAYRSLIEAT ATTGADGAVL AVPRKATKGY GVLETKAGTL LAKIGGEGQW IFGGLALLPR AALRIIEQAG LYEGLNQIAQ RSKIAVVPWS GTWHDVNHPE DLMQLLEYTA PRNTIIAKTA KVSPTAVLEG PVVIEDGVEI DHYAVIKGPA YIGKGAFIGA HALIRNYTDI EEGAVIGSST EVSHSLICER ATVGRGSFVS YSVVGEEAVL EPNIVTMSVL REGRDRLEPI QVRGQVYYKL GALIPRKARV SAGTTLPPGA GWD
|
| |