Gene Hlac_2493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2493 
Symbol 
ID7401545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2470184 
End bp2472547 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content71% 
IMG OID643709565 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_002567136 
Protein GI222480899 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.126978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGT CGCCGTCCCG TGTCGGGATT CTCACCGACG GGGACTTCTC CGAGGAGCAG 
CGCGCCGCGG CCGACTGGCT GGCCGCGCGC GACGGGATCG CGATCGAGAG AGTCGCGTTC
GACGATCTCG CCGCACCGAT CTCGGATGCG TCCTCGGAAT CATCCACGGA AGTGTCCGCA
GAGTCGTCCC ACCCGTCCCC GGACGAGTCG TATCCTGGCC TAGGTGTCGG TGACCGCAGA
TCCCGAACCG ACCCGCTTCG CCGGTTCGAC GCGCTCTGGT GGCATCGCGC TGCCCCGATC
GCCGAGAGCG ATCCCCTCGC CGACGCGGCC GAAGGGATCG ATCGATACCT CGCTGCCGGC
GGTGGTCTCC TCCTGACGCT CCGCGCGATG GCGTCGGTCG ACCGCCTCTC CGTCGAGTCG
GTCCCGCCCG ACCGGGTCGG CGAGGACTCG CTCGGGGAGC CGACCGGCGT GCTGTGGCGG
TCGCTGTACG CCGACCACCC GGCGATGGCG TCGCTGGAGG GACTCCGACA CCACGTCCGG
GAGCGCGGCA CGGTCCCGAC GGTCCGGTAC GAGCGCGTGC TCCCCGAGCG CGGTGAACCG
CTCGCCTCCA CGCTCCGCGG CGACACCGCG ATCCCGGACG AGGTGACGGC GGTCTCGTGG
CGAGTCGGGG AGGGCGACGA GCGCGGCGGG CCGGGGGGTG CAGTCGTCGG ACTCGGCGCG
CCCGTCTTGT TCGCCGATCC GCCGGGTATC GACGATCACG CCCACGATCT GGAGGTCCAC
GATCCCGACG CCCCGGGGCT TGCCGGGACG CGCGACTGCC TCGTTGCCGG CTGCCTCCGA
AGCCTCGCCG TCGACGACGG GACGCCCGCT CGCCCGACCG ACGCCGACGA CATGCGCCGG
CTCCGCGATC GGATCGATGA CGCCGGCGAG GACGGCCCCG GCGGCCGTCC CAAGTACCAC
CTCACGCCAC CCGCTAACTG GCTGAACGAC CCGAACGGGC TGATCCGATG GGACGGGCGG
TACCACGTCT TCTACCAGTA CAACCCCGGC GGCCCGTTCC ACAACACGAT CCACTGGGGC
CACGCCGTCA GCGACGACCT CGTGACGTGG CGCGACGAGC CGGTCGCGCT CTCACCCTCC
CCGGACGGCC CCGACCGCGA CGGCTGCTGG TCGGGCTGTG CGGTCGACGA CGACGGTACC
CCAACGCTTC TGTACACCGG CGGAAACGGC CGTGATCAGC TCCCCTGTCT CGCGACGACT
GACGACCCCG ACCTCCGGTC GTGGGAGAAG TACGAGGGGA ACCCAGTCAT CGAGTCGCCG
CCGGCCGACC TCGACGTGCT CGAAACGGAG CACTGGCGCG CCGAGTTCCG CGACCACAAC
GTCTGGCGCG AGGACGGGCG CTGGCACCAC CTCGTCGGTA CCGGGCTCGT CGACGGCGGC
GGCGCGGCCC TGCTGTACAC CGGAGAGACG CTCACCGAGT GGACCTACGA GGGCCCCTTG
CTCGCCGGTG GGCCGGACGC CGGCGCGGTG TGGGAGTGCC CCGAACTGCT CGATCTGGGC
GACCGCCGAC TCCTCCACGT CTCAGACTAC GAGAACGTCG TCTACTTCCT CGGGACCGTC
GAGGACGGCG AGTTCGTGGT CGACTCCGAA GGGGTGCTCG ACCACGGCGA CTTCTACGCG
CCGCAGTCCC TCTCGGACTC GAACCGAGGT GCCGAAGATG AGACTGATAC GGAGCGCTCG
CTCACGTGGG GGTGGCTCCC CGAGGCCCGC GACGTGGACG CACAGTGGGA CGCCGGCTGG
TCGGGCGCGC TCTCGCTCCC CCGGGTGATC GAGACCGCCC CCGACGGCGA CCTCCGCCAG
CGTCCGGCCG ACGAGGCGAC CGACCTCCGG ACCGAGCGGC TCGCCGACGG CGAAACGGTC
GCGCTCGCGC CCGACGACCA GCGCCGCCTC GACGTCTCGG GTGCGGCGAT CGAGATAGAG
ATAGAGATCG CACTTGACGA TGCCGAGGCG GTCGAGATAT CCGTGTTCGA GACGCCCGAC
CGCGCCGAAC ACACCCCGAT CCGCTACGCG CGCGACGGGA CTCTGTCGAT CGATCGCACC
CCGTCGAGCC GAGACCCGCG CGCGTTTGCC GACGCGCAAT CGATGGCGGT TCCTCCCTAC
GACGAGCCCC TCTCGCTGCG CGTCTTCCTC GACCGCTCCG TGATCGAGAT CTACGCCAAC
GGTCGCCACT GTCTCACGAG CCGGGTGTAC CCGACCCGCG ACGACGCGGT CGGCGTCTCC
GCGCGGGCCG AGGGCGGGCG CGCGGAGATC GCGTCGCTGT CGGCGTGGGA ACTCGGCGAG
GCGATGCCGA CGGACGGCGA CTGA
 
Protein sequence
MTASPSRVGI LTDGDFSEEQ RAAADWLAAR DGIAIERVAF DDLAAPISDA SSESSTEVSA 
ESSHPSPDES YPGLGVGDRR SRTDPLRRFD ALWWHRAAPI AESDPLADAA EGIDRYLAAG
GGLLLTLRAM ASVDRLSVES VPPDRVGEDS LGEPTGVLWR SLYADHPAMA SLEGLRHHVR
ERGTVPTVRY ERVLPERGEP LASTLRGDTA IPDEVTAVSW RVGEGDERGG PGGAVVGLGA
PVLFADPPGI DDHAHDLEVH DPDAPGLAGT RDCLVAGCLR SLAVDDGTPA RPTDADDMRR
LRDRIDDAGE DGPGGRPKYH LTPPANWLND PNGLIRWDGR YHVFYQYNPG GPFHNTIHWG
HAVSDDLVTW RDEPVALSPS PDGPDRDGCW SGCAVDDDGT PTLLYTGGNG RDQLPCLATT
DDPDLRSWEK YEGNPVIESP PADLDVLETE HWRAEFRDHN VWREDGRWHH LVGTGLVDGG
GAALLYTGET LTEWTYEGPL LAGGPDAGAV WECPELLDLG DRRLLHVSDY ENVVYFLGTV
EDGEFVVDSE GVLDHGDFYA PQSLSDSNRG AEDETDTERS LTWGWLPEAR DVDAQWDAGW
SGALSLPRVI ETAPDGDLRQ RPADEATDLR TERLADGETV ALAPDDQRRL DVSGAAIEIE
IEIALDDAEA VEISVFETPD RAEHTPIRYA RDGTLSIDRT PSSRDPRAFA DAQSMAVPPY
DEPLSLRVFL DRSVIEIYAN GRHCLTSRVY PTRDDAVGVS ARAEGGRAEI ASLSAWELGE
AMPTDGD