Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0594 |
Symbol | |
ID | 5054853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 529856 |
End bp | 531061 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468153 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001152838 |
Protein GI | 145590836 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAGA AATACGCCCA ATTCGTGGGG GAGGATGAAA TAGACGCAAT AGTCAAGCTA GCGGAGCGGC TCCAAGACCT CTCCATCCTA CACGTGAACT CCACCGCCGC CGGGGGCGGC GTTGCGGAGA TCCTAAACAG GATGGTGCCC CTAATGCGGG AACTAGGGCT AAGGGTTGAC TGGAAGGTGA TACGGGGCGA CGCGGAGTTC TTCACCGCAA CTAAGACTTT CCACAACGCC CTCCAGGGCA CGGTGCGGGA TGTGCCGAGC CATCTGTATT CCGTATACGA GAAGTGGCAG GAGATAAACG CCAATGAGCT GGATCTCGAC TACGACGTGG TTTTCATACA CGACCCCCAG CCAGCCGGCC TTATCAAGTA CAGGAAGAGG GGAAAGTGGA TTTGGAGGTG CCACATAGAC CTCTCTACGC CGCATCCCGA GGTCTGGGCG TTCCTCAAGC GGTACGTCTC CATGTACGAC CTCGCCATAT TCCACATACC CGAATTTGCC AGAGACGACC TGGAGATACC CCAGCTCCTC ATACCCCCCT CGATAGACCC CCTCAGCCCC AAAAACAAGG AGCTGCCGCC AACCACGGTG GAGCGCATAG TGGCGAAATT CGACGTCGAC ACGGAGAGGC CAATCTTGTT GCAAGTCTCC CGCTTTGACT GGGCAAAGGA TCCCCTGGGC GTCGTGGAGG CGTATAGGCT GGCCAAGCGC CACGTCCCCG ACCTCCAGCT GGTCTACCTG GGTAGCCCCG CGCACGACGA CCCCGAGGGG GAGGCCGTCT ATAAGAAAAC TGTGGAGGCC GCCGGCGGCG ACTCAGACAT ACACCTCCTC ATGCTACCAC CTGACAGCCA CGTGGAGGTC AACGCCTTCC AACGCGCCGC CACTGTGGTG ATGCAGAAGT CTATAAGAGA GGGCTTCGGG CTCACGGTCT CGGAGGCGTT GTGGAAAAGC AAGCCGGTGG TAGGCGGAAG GGCGGGGGGC ATTAAAATCC AGGTAATACA CGGAGTCACC GGCTTCCTCG CCACCTCCCC CCGCGTCGCC GCCCACTACG TCACCTTCCT CCTCAGGGAG AAGGAGATAA GGGAGAAGAT GGGCGCCGCA GGCAGAGAAC ACGTCCGGAG AAACTTCCTA ATAACCCACC ATCTGAGGCG CTACCTCATG GCAATAGCCT ACGCAACCGG GAGACACAGA AGATAG
|
Protein sequence | MIEKYAQFVG EDEIDAIVKL AERLQDLSIL HVNSTAAGGG VAEILNRMVP LMRELGLRVD WKVIRGDAEF FTATKTFHNA LQGTVRDVPS HLYSVYEKWQ EINANELDLD YDVVFIHDPQ PAGLIKYRKR GKWIWRCHID LSTPHPEVWA FLKRYVSMYD LAIFHIPEFA RDDLEIPQLL IPPSIDPLSP KNKELPPTTV ERIVAKFDVD TERPILLQVS RFDWAKDPLG VVEAYRLAKR HVPDLQLVYL GSPAHDDPEG EAVYKKTVEA AGGDSDIHLL MLPPDSHVEV NAFQRAATVV MQKSIREGFG LTVSEALWKS KPVVGGRAGG IKIQVIHGVT GFLATSPRVA AHYVTFLLRE KEIREKMGAA GREHVRRNFL ITHHLRRYLM AIAYATGRHR R
|
| |