Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1872 |
Symbol | |
ID | 5055761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1676234 |
End bp | 1677124 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469418 |
Product | ROK family protein |
Protein accession | YP_001154075 |
Protein GI | 145592073 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTGT ATCTAGGCGT TGATGTGGGA GCGACGTGGA CGAGGGCCGT ATTGGTGGAT GACCATGCAA ATGTGGTAAA CAGGTTGAAA ATTAGAACCA GTAAAAACCC CCTTGCCGAC GTCGCCGAGG CAGTAGCGGG GTGGAAGTTT GACAGTATAG GCGTGGGATC CATCGGGCCT ATGGACTTGA GAAGCGGATG GGTTACTAAC TCGCCTAATT CTCCAGCTAG GCAGTTCCCA CTAGTCGAGC CGTTGAAAAA ACTGAGCAAA CCAGTAGTTG TTGCAAACGA CTGCGTCGCA GCAGTGTGGG GAGAGTATGT CTTTAAACAC GGCGTCGACA ACATGGTGTA CTTAACTCTG TCCACTGGCG TTGGGGTTGG CGCAGTGGTT AACGGCACTC TACTGCTGGG CAAAGACGGC AACGCTCACG AACTGGGACA CGCCGTGATA GATTTCCGTG CGGGTCGCCA GTGCGGATGC GGCGGCTTTG GCCACTTCGA GGCATATATT GGTGGGGCGA ATGTGCCCAA GTGGTTCCGC GAAGTCTCCG GCGAGGCGGT AGCAGACGCC GCCGAGGTGT TCAGTAGATA TAGAGCTGGG GATTCAAAGG CGGTGGAGTT TGTCAACCTC TGGCTCGACG CCTTGGCCGC GGGCATAGCC ACAGTGGTGG CGGCCTACGA CCCCGAGCTG TTGATAGTGG GGGGATCAGT CGCCTTGAAC AACTGGGATA TCATAATCCC CAAGCTTTCC CCGCGTTTGG CGAAATACTT AGGCGTGCGG CCTCCCAAAA TTCTCCAGGC ATCTTTTGGA GACGACGAAG TGGCGATAGG CGCCGCCGCC CTCGCTTATA AAACCCCAGA TAGCTTGAAA AAGTTCGGAT ATCCTAGATA G
|
Protein sequence | MPLYLGVDVG ATWTRAVLVD DHANVVNRLK IRTSKNPLAD VAEAVAGWKF DSIGVGSIGP MDLRSGWVTN SPNSPARQFP LVEPLKKLSK PVVVANDCVA AVWGEYVFKH GVDNMVYLTL STGVGVGAVV NGTLLLGKDG NAHELGHAVI DFRAGRQCGC GGFGHFEAYI GGANVPKWFR EVSGEAVADA AEVFSRYRAG DSKAVEFVNL WLDALAAGIA TVVAAYDPEL LIVGGSVALN NWDIIIPKLS PRLAKYLGVR PPKILQASFG DDEVAIGAAA LAYKTPDSLK KFGYPR
|
| |