Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0043 |
Symbol | |
ID | 7401396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 45462 |
End bp | 46442 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707102 |
Product | ROK family protein |
Protein accession | YP_002564719 |
Protein GI | 222478482 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0272588 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTACG TGGGCGTCGA CCTCGGAGCG ACAAACGTCC GGGCGGTCGT CGGCGACGAA ACCGCAACCG TCCTCGGATC CGACTCGCGA GGGACCCCGA GCGGGCCAAA CGGGATCGCG GTCACCGAGG CCGTTCTCGG CGTGGTCCGC GGCGCGTGCG AGGACGCCGG AATCGACCCG ACCGCGGTAG TCGCGGCTGG GATCGGCTCG ATCGGTCCCC TCGATCTGGC TGCCGGGATC GTACAGGGAC CGGCGAATCT CCCGGACACC GTCGAACGAA TTCCCCTCAT CGGACCGGTT TCACAGCTGT TAGACACCGA CGAGGTCCAT CTCCACAACG ACACCATCGC GGGCGTCATC GGCGAGCGGT TCCATTCCGA GCGCAACCCC GACGACATGG TGTATCTCAC CATCTCCTCC GGTATCGGTG CCGGCGTCGC CGTCGATGGC AACGTGCTCT CGGGGTGGGA CGGCAACGCC GGCGAGGTCG GCCACATGAC GGTCGACCCG CACGGCTTCA TGACCTGTGG ATGCGGGCTC GACGGCCACT GGGAGGGGTA CTGCTCGGGC AACAACATCC CGAAGTACGC CCGCGAGCTC CACGAGGAGG ACCCGATCGA GACCTCCCTG CCGATCGAGG ACCCCGACTT CTCCGCGGTC GACGTGTTCG AGGCGGCCGG CGAGGACACC TTCGCCGACC ACGTGATCGC TCAAGTCGCC CACTGGAACG CGATGGGCGT CGCCAACGTC ATCCACGCGT ACGCCCCACT AGTCGTGAGC GTCGGCGGCG CGGTCGCGCT CAACAACCCC GAGTTGGTGC TCGACCCGAT CCGCGAGAAG CTCGCGGACA TGGTGTTCAT CAACGTCCCC GAGGTTCGCC TCACCGAACT TGGCGACGAC GTGGTGGTGA AGGGCGCACT CGCGAGCGCG CTCACGGGCG GCACGGGCGA CCGATCGCGG GTCGACCCGC CACCGAGGTG A
|
Protein sequence | MYYVGVDLGA TNVRAVVGDE TATVLGSDSR GTPSGPNGIA VTEAVLGVVR GACEDAGIDP TAVVAAGIGS IGPLDLAAGI VQGPANLPDT VERIPLIGPV SQLLDTDEVH LHNDTIAGVI GERFHSERNP DDMVYLTISS GIGAGVAVDG NVLSGWDGNA GEVGHMTVDP HGFMTCGCGL DGHWEGYCSG NNIPKYAREL HEEDPIETSL PIEDPDFSAV DVFEAAGEDT FADHVIAQVA HWNAMGVANV IHAYAPLVVS VGGAVALNNP ELVLDPIREK LADMVFINVP EVRLTELGDD VVVKGALASA LTGGTGDRSR VDPPPR
|
| |