Gene Hlac_2869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2869 
Symbol 
ID7399105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp129829 
End bp131868 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content67% 
IMG OID643706689 
ProductAlpha-galactosidase-like protein 
Protein accessionYP_002564315 
Protein GI222475794 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAC ACAGGGTGGG CGCTACTGCA GTCACCCACG ATCCGGATGG CCATGCGGTC 
ACGGTCGGAG ACGACGCCGG TGACCTACTT GTCGGGACGG TCCACGCGGC GACGGCCGAC
GGAGACGACG GGCGGCTACA GGTCGCTGAC GTGTCGCTGT CGACCGAGGC GAGAGGTGTC
GTCGCGACCT ACACCGTGGA AAACGTCGGC GACGAGCCGG TCCGGGCAGG CGACGTGACA
CTCGCGTTCG AGACCGCGTT CGGCGCCGAT GCCCGCGTCT ACCGGCACGG GTATCAATCG
TGGTCGTCGA CGGGAACGCT CCCCGTCGGA GAGCGATTCT CACCCGAGAA CTCGGATAAC
GCTCCGATGA TGAATGACCT CGCGGCGTCG ACGGACGACC GGGTCAGCAG CTATCTGACC
GGGCTTGTCG AGGGCGACCG ACGCGTCACT GCGGGCTTTC TCGAACATGA CCGCTACTGT
TCTCGATTCG AGATCGATGA CGACGATGGC GGCGTTGCGA CGCTGCACGC AGTGTGTCCA
CTGGAAGGGG CACGGCTGAC ACCCGGTGAA CGGCTGACTC TGCCACCACT GTGGGTCGAC
GCTGACCGCG AACTTCGAGA GGGGCTGACC GCGCTGGCCG ATTGCATCGG TGAGCGGATG
GATGCTCGCG TGCCTGAAAT CTCACCGACC GGATGGTGCT CGTGGTATCA CTATTTCACC
GATGTCACCG AGGCGGACGT GCGGGAGAAC CTCTCGGAGC TTCGAGAGTG GGGAATTCCG
GTCGACGTCG TTCAGATCGA CGACGGATAC ATGCAGGCGT TCGGCGACTG GCGGTCGATC
GCCAACGGGT TCGAGGACAT GAGCGCCGTC GCGGACGACA TCGCGGCTTC GGGGTACCGA
CCCGGGCTGT GGCTCGCGCC GTTCTACGTC GAGGCGGGTG CCGACCTATA CGCTGACCAC
CCGGAGTGGT TCATCACGGA GCCGACGGAC GCAGACACCG ACGGCCCGGG AACACCCGTC
GACGGTGGCT TCCGAGCCGG GTCAGAACTA TACGGACTCG ACACGACGCA CCCGGCGGTG
CTTGAGTGGC TCCGGGAGAC CGTGTCGACG GTTGTCGACG ACTGGGGGTT CACGTATCTG
AAGCTCGACT TCCTCTTTGC GGCGGCGCTG CCCGGCGAGC GGTACGACGC CGAGGCAACC
CGGATCGAGG CGTATCGCCG GGGGGTCGAG GCGATAGCGG AGGCGGCCGG CGACGATGTG
TTCCTTCTCG GCTGCGGTGC GCCGATGGCC CCGAGTGTCG GGCTCTTCGA CGCGATGCGG
GTCGGCCCAG ACACCGATCC GGTCTGGGAG ACTCCGGGAG AGTCGGGGAG CCAGCCGGGA
TTGAAAAACG CCGTTCGCAA CACGCTCACA CGGAACTACC TGCATCGACG GTGGTGGCTC
AACGATCCCG ACTGCCAACT TGTTCGAGAC ACGAGTGACC TCACTGCGGC CGAGCGCGAA
GCCTTCGCGA CGCTCGTCGC CGCGACGGGT GGCGTGAACA TATTTTCGGA CCGGCTCGCG
GAGATCGGTT CGGCCGGCCG GCGGCTGCTT GAGCGGTCCA TCCCTCCTGC AAGCGGCGGC
GAGGTCTCCG GGCTCAGTGT GGAGCGGTTC CCCTCGCACG TCGTCTGCGA CCGGCCGGGC
GACGGGGCCA CGACCGTCGC GCTGTTTAAT TGGGCGGACG AGCCGTCGAC GGTCCGATTC
GACGCGCGCG AGCACGTGGA GGGCGACTCA GATGCCGATC ACGTCATCTG GGACGGGCTG
TCCGGTGCGG TCGTCGACGG GCCGGTCGTC GAGCGAGAGC TACCCTCTCA CGGTGCGGCG
GTGTTTGCTG TCGTTCCCGC CACCACCGGC AACCTCCTGG GCGACGCCGC GACGCTCACG
GGTGGGTCCG ACCGCGCGTC GACGGCGACG CTTCGGGACG GAGCACTCGA AGCAGCCGTC
GACGGGGAGG CGGTGACGTT CGCGCTCGAG CGGGACGGTC CCACGTCTAC TCAGGAGTAG
 
Protein sequence
MNEHRVGATA VTHDPDGHAV TVGDDAGDLL VGTVHAATAD GDDGRLQVAD VSLSTEARGV 
VATYTVENVG DEPVRAGDVT LAFETAFGAD ARVYRHGYQS WSSTGTLPVG ERFSPENSDN
APMMNDLAAS TDDRVSSYLT GLVEGDRRVT AGFLEHDRYC SRFEIDDDDG GVATLHAVCP
LEGARLTPGE RLTLPPLWVD ADRELREGLT ALADCIGERM DARVPEISPT GWCSWYHYFT
DVTEADVREN LSELREWGIP VDVVQIDDGY MQAFGDWRSI ANGFEDMSAV ADDIAASGYR
PGLWLAPFYV EAGADLYADH PEWFITEPTD ADTDGPGTPV DGGFRAGSEL YGLDTTHPAV
LEWLRETVST VVDDWGFTYL KLDFLFAAAL PGERYDAEAT RIEAYRRGVE AIAEAAGDDV
FLLGCGAPMA PSVGLFDAMR VGPDTDPVWE TPGESGSQPG LKNAVRNTLT RNYLHRRWWL
NDPDCQLVRD TSDLTAAERE AFATLVAATG GVNIFSDRLA EIGSAGRRLL ERSIPPASGG
EVSGLSVERF PSHVVCDRPG DGATTVALFN WADEPSTVRF DAREHVEGDS DADHVIWDGL
SGAVVDGPVV ERELPSHGAA VFAVVPATTG NLLGDAATLT GGSDRASTAT LRDGALEAAV
DGEAVTFALE RDGPTSTQE