Gene Hlac_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0043 
Symbol 
ID7401396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp45462 
End bp46442 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content68% 
IMG OID643707102 
ProductROK family protein 
Protein accessionYP_002564719 
Protein GI222478482 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0272588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACG TGGGCGTCGA CCTCGGAGCG ACAAACGTCC GGGCGGTCGT CGGCGACGAA 
ACCGCAACCG TCCTCGGATC CGACTCGCGA GGGACCCCGA GCGGGCCAAA CGGGATCGCG
GTCACCGAGG CCGTTCTCGG CGTGGTCCGC GGCGCGTGCG AGGACGCCGG AATCGACCCG
ACCGCGGTAG TCGCGGCTGG GATCGGCTCG ATCGGTCCCC TCGATCTGGC TGCCGGGATC
GTACAGGGAC CGGCGAATCT CCCGGACACC GTCGAACGAA TTCCCCTCAT CGGACCGGTT
TCACAGCTGT TAGACACCGA CGAGGTCCAT CTCCACAACG ACACCATCGC GGGCGTCATC
GGCGAGCGGT TCCATTCCGA GCGCAACCCC GACGACATGG TGTATCTCAC CATCTCCTCC
GGTATCGGTG CCGGCGTCGC CGTCGATGGC AACGTGCTCT CGGGGTGGGA CGGCAACGCC
GGCGAGGTCG GCCACATGAC GGTCGACCCG CACGGCTTCA TGACCTGTGG ATGCGGGCTC
GACGGCCACT GGGAGGGGTA CTGCTCGGGC AACAACATCC CGAAGTACGC CCGCGAGCTC
CACGAGGAGG ACCCGATCGA GACCTCCCTG CCGATCGAGG ACCCCGACTT CTCCGCGGTC
GACGTGTTCG AGGCGGCCGG CGAGGACACC TTCGCCGACC ACGTGATCGC TCAAGTCGCC
CACTGGAACG CGATGGGCGT CGCCAACGTC ATCCACGCGT ACGCCCCACT AGTCGTGAGC
GTCGGCGGCG CGGTCGCGCT CAACAACCCC GAGTTGGTGC TCGACCCGAT CCGCGAGAAG
CTCGCGGACA TGGTGTTCAT CAACGTCCCC GAGGTTCGCC TCACCGAACT TGGCGACGAC
GTGGTGGTGA AGGGCGCACT CGCGAGCGCG CTCACGGGCG GCACGGGCGA CCGATCGCGG
GTCGACCCGC CACCGAGGTG A
 
Protein sequence
MYYVGVDLGA TNVRAVVGDE TATVLGSDSR GTPSGPNGIA VTEAVLGVVR GACEDAGIDP 
TAVVAAGIGS IGPLDLAAGI VQGPANLPDT VERIPLIGPV SQLLDTDEVH LHNDTIAGVI
GERFHSERNP DDMVYLTISS GIGAGVAVDG NVLSGWDGNA GEVGHMTVDP HGFMTCGCGL
DGHWEGYCSG NNIPKYAREL HEEDPIETSL PIEDPDFSAV DVFEAAGEDT FADHVIAQVA
HWNAMGVANV IHAYAPLVVS VGGAVALNNP ELVLDPIREK LADMVFINVP EVRLTELGDD
VVVKGALASA LTGGTGDRSR VDPPPR