Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1032 |
Symbol | |
ID | 5056147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 919146 |
End bp | 920285 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640468588 |
Product | hypothetical protein |
Protein accession | YP_001153262 |
Protein GI | 145591260 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2407] L-fucose isomerase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.775746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATTG AGGTGGCCGC CGCACCCAAC GTAGACGCCG AGACGCGGGA GGAGTACAGA TCCCTCTACC GAAAAACCCT AGGGGGCCTT GGGGAGGGGG CGAATTTCAT AGTGGTCTTG ACGGGGGGCT CAGAGCCCGA GATCCTAGCC GCCGCCGGGG ACTACAACAT CATCCTCGCG TGGCCCCACT ACAACTCCCT CCCAGCCGCG CTTGAGGCCG CCGCGGCGCT TAAGGAGGCG GGGAGGTTTG CCCACGTAGT CCAGCTGGAG GGGCCAGGTG CGGAGCCGCC TAGAGACAAG CTGGAGAGGC TCCTCCGGGT GGTGGACCTC TTGAGGAGAC CGCCCAGGCT CGGGCTTGTG GGGTCGCCTA ATAGGTGGCT TGTGGCCTCA TGGCTGAGGG GCAAGCCAGA CGTAGTTATC GACGAGGGGG AGGTCTACGC CCGCAGCGTG GAGAGGGACG GGGCGGACGT CGCCGAGAGG CTTGTGAAAG GCGCAGAGCG TAGCGACTTC TCGGCCCGGG ATCTAGCTCC CATAGCCGCG TATGCCAAGA CCCTGGCAGA GCGCGCCTCG GGGCTTGACG GAATCACCCT GGGCTGCTGG TGCTTCGACT TCGAAGAGGT TAGGAAGAGG GGGTGGACGC CTTGCATTTC CCTCGCCCTC CTCAACGACT GGGGGGTAAT GGCCACGTGC GAGGGGGATG TGAGGGCTCT CTACTCCGCG GTGGTGTTAA GGCGCCTCTC CGGGAGGCCG AGTTGGATTA GCAACGTGAA CAAGATATAC GACTGGGGCC TCTTGTTGAC GCACGACGGG GCGCCGCCAA GCTTTGGCAA ATACGCAGTG GTCCCCCGCA TGGCTACCAA AGCCGCCGCG GCGCTTAGGG TAACGGTGGA GCCGGGGAGG CCGGCCACCT TGCTGAGGGT TTCAGGAGAC TTGAAGAGGG CGCTTCTGCT GAAGGGGGTC ACGGCGGAGG GGGAGAGGGT AGAGGCGTGT AGCACCCAGA TCGCTGTTAG GCTCACCGTG GGGTCTGGAA GAGACGTCTT GAGGGCTGGG CTCGGCAACC ACTTAGCCTT CGTCCTCGAC GACGTTTACG AAGAGACTAG GCTCTACTTG GAGCATCTAG GGGCTCAGGT CATCCCCTAG
|
Protein sequence | MPIEVAAAPN VDAETREEYR SLYRKTLGGL GEGANFIVVL TGGSEPEILA AAGDYNIILA WPHYNSLPAA LEAAAALKEA GRFAHVVQLE GPGAEPPRDK LERLLRVVDL LRRPPRLGLV GSPNRWLVAS WLRGKPDVVI DEGEVYARSV ERDGADVAER LVKGAERSDF SARDLAPIAA YAKTLAERAS GLDGITLGCW CFDFEEVRKR GWTPCISLAL LNDWGVMATC EGDVRALYSA VVLRRLSGRP SWISNVNKIY DWGLLLTHDG APPSFGKYAV VPRMATKAAA ALRVTVEPGR PATLLRVSGD LKRALLLKGV TAEGERVEAC STQIAVRLTV GSGRDVLRAG LGNHLAFVLD DVYEETRLYL EHLGAQVIP
|
| |