Gene Hlac_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2451 
Symbol 
ID7400569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2429276 
End bp2430496 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content70% 
IMG OID643709524 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_002567096 
Protein GI222480859 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC ACGTCACACA TCCCGCCGCC GATAGACGCA CCGTGCTCCG AGCCACCGGT 
AGCCTCACGG CGCTCTCGCT TGCCGGCTGT CTCGGCAGCG GCGGAGCGAT CGACGACGAA
GACGTCGGGT ATTCCGTGGA GCGTGTTGCG GACGGGTTCG AGAACCCATG GGGACTCGCG
TTCCTCCCGG ACGACGGGCG GCTGCTGGTC ACCGAACGCC CCGGTCGGCT CTCGCTCGTC
GACCCCGAGG ACGGGACGCG CTCGACCGTC CAGGGGATTC CCGAGGTCCA CGCCGCCGGA
CAGGGCGGGC TCCTCGACGT CGCGGTCCAC CCCGAGTTCC CGGGCGGCGC GCGCGTGTAC
CTCACGTACG CCGCGACGAA CGACGCGGGC GAGTCGGCGA CGCACGTTGG AAGCGGGCAA
CTCTCTCTCG CCGCCGACGA CTCTCCCGCG CTCGACGGGT TTGAGGCGCT CCACGTCGCG
GAGCCGTTCG TCGACTCTGA TCTCCACTTC GGCTCGCGGG CGACGTTCGG CCCGGACGGC
GCGCTGTTCG TCACAGTCGG GGATCGGCGC GACACCAATT TCAGCCCGGA CCACGTCTCG
CAGGACCGAT CTGTGGAGCT CGGCTCGACT CTTCGGCTCA CTTCCGATGG AGATGCCCAC
CCCGATAATC TGTTCGTCGA CGACGCGGAG GCGGCGGACG CGATCTACAG CTACGGGCAC
CGGAACCCGC AGGCGATGGC GGTGCGCCCT GAGACGGGTG CGATCTGGCA GTGCGAGCAC
GGTGAGGAGG ACGGCGACGA GATCAACGTG ATCGAGCGCG GCGGCAACTA TGGCTGGCCG
GTCGCCAGTG AGGCGTGTCG GTACGGCACC GACGAGCGCG TCGCCCCGAG CCACCGCGAG
CGCAGCGATG TCGTCGCCCC CGTTCACTAC TGGCCATGCG GCTCCGGCGG CTTCCCCCCG
AGCGGGGCCG TCTTCTACGA TGGCGACGCC TTCCCCGACT GGCGCGGTGA CCTGTTCGCG
GGGACGCTCG CCGGGCGGTA TCTCGGGCGG TTCACAGTCG AGGGTGCAGG TGCTACCGAC
ACCGCGGTGA CCGAACGCGA CTCCCTCCTC GCCGACCGCG ACTGGCGGAT CCGGTCACTC
GCGGTCGAAC CGGCCACGGG CCACCTCTAC GTCGCCGTCG ACGACGCGGA TGCGCCGATC
GCACGGATCG TTCCGGTCTG A
 
Protein sequence
MARHVTHPAA DRRTVLRATG SLTALSLAGC LGSGGAIDDE DVGYSVERVA DGFENPWGLA 
FLPDDGRLLV TERPGRLSLV DPEDGTRSTV QGIPEVHAAG QGGLLDVAVH PEFPGGARVY
LTYAATNDAG ESATHVGSGQ LSLAADDSPA LDGFEALHVA EPFVDSDLHF GSRATFGPDG
ALFVTVGDRR DTNFSPDHVS QDRSVELGST LRLTSDGDAH PDNLFVDDAE AADAIYSYGH
RNPQAMAVRP ETGAIWQCEH GEEDGDEINV IERGGNYGWP VASEACRYGT DERVAPSHRE
RSDVVAPVHY WPCGSGGFPP SGAVFYDGDA FPDWRGDLFA GTLAGRYLGR FTVEGAGATD
TAVTERDSLL ADRDWRIRSL AVEPATGHLY VAVDDADAPI ARIVPV