Gene Hlac_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2004 
Symbol 
ID7402023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1997922 
End bp1999013 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID643709075 
Producthypothetical protein 
Protein accessionYP_002566652 
Protein GI222480415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.069614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.292066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA GAACGAGCGG GACGGCGTCC GAAGAGGCGA ACGAGGGCGA GAAAGCGGAG 
GGAGGCGAAG ATACCGACGA CGGCGACACG GCCGACACGA TGCGCGTTCG CGCCGGTGAT
AGCCGGGTGA AGCTCTGGCT GCTGTTGCGT GCGAACCGGC TCGTCGTCGC GGGAGTCTTA
ACACTCGTCG TCTTCGTCGC GTTCGTCACC GTGGCGGCCG CGTTCTCCCC GTCTCTCGCT
GAGAAGATCG GGTCGGGCGA TCCGATCGAT ACGCTGTTCT CGACGATGAT CGCGGCGATC
GTCACCGGAA CGACGCTCGT CGTCACAATC GGTCAGGTCG TGCTCACGCA GGAGAACGGC
CCGCTCGGAG ACCAGCAGGA GCGCATGAAC GACACGCTCA CCGTTCGAGA CTCTATCGCG
GAACTGACCG GCTCCCCGGT GCCCACGGAC CCCGCCGCGT TCCTCGATGC GATCCTCGTC
GCCGCGTCGG AGCGCAGCCG GGCCCTCCGC GAGTCGGTTC GGGAGCGCGA CGGCGACCGG
TCGGACCGGA TCGCCATCCG AGAGGACGTC GACGACCTCG CCGCGAATAT CATCGAAAAC
GCCGACGGCG TGCGCGACAG TCTGGACGGT GCGGAGTTCG GCTCCTTCGA CGTGGTGTTC
GCGGCCATCG ACTTCGACTA CAGCCCGAAG ATCGGCCAGA TCGAGCGCGT CGACGACGAC
CACGACGACG CGTTCACCGA CGACGAGCGC GCTCTGCTCA AGGAGCTGAA GGAGTCGCTG
TCGCTGTTCG GTCCCGCCCG CGAACACATC AAGACGCTGT ACTTCCAGTG GACGCTGATC
GACCTCTCGC GGCAGATCCT CTACGCCGCG GTGCCCGCGT TGGTCGTCGC GGGGCTCATG
CTCGCGGTCG TCGACGCCGG GACGTTCCCC GGGAGCACCC GCGGGGTCGA CCACGTGACG
CTCGTCGTCG GGGCTGCGTT CGCGGTCACG CTCCTCCCTT TCTTGCTTTT CGTCTCCTAC
GTGCTCCGCG TACTCACCCT CGCGAAGCGC ACGCTCGCCA TCGGGCCGTT GGTGCTGCGG
GACTCGAAAT GA
 
Protein sequence
MTERTSGTAS EEANEGEKAE GGEDTDDGDT ADTMRVRAGD SRVKLWLLLR ANRLVVAGVL 
TLVVFVAFVT VAAAFSPSLA EKIGSGDPID TLFSTMIAAI VTGTTLVVTI GQVVLTQENG
PLGDQQERMN DTLTVRDSIA ELTGSPVPTD PAAFLDAILV AASERSRALR ESVRERDGDR
SDRIAIREDV DDLAANIIEN ADGVRDSLDG AEFGSFDVVF AAIDFDYSPK IGQIERVDDD
HDDAFTDDER ALLKELKESL SLFGPAREHI KTLYFQWTLI DLSRQILYAA VPALVVAGLM
LAVVDAGTFP GSTRGVDHVT LVVGAAFAVT LLPFLLFVSY VLRVLTLAKR TLAIGPLVLR
DSK