Gene Hlac_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2472 
Symbol 
ID7401524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2450530 
End bp2452317 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content69% 
IMG OID643709544 
Producthypothetical protein 
Protein accessionYP_002567115 
Protein GI222480878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGGG TTCTCACCCT CTCGGTTATC GTCGCCGTGA TTGCCCTTCT CTCGATCGGT 
GCCGGCGGAG CCGTTGCCGC CGGGCCAGCG GTCGCCGGGT CGACGGCCGC CATTCCGGGG
GACGCTCCGG CCAGCGCGAC TGCCACAGAG TGCGACGCCG GCGACGGAAC CGATCTCGTC
GGCTGCTGGA ACGGGACCCA CTACGAGGAG GAGCTCGCCT TCAACCAGAC GGACGGGCTG
ACCGAGGCGG AGCTGGAGGA GCTGACTCAC CTGACGATGG CCCGCGTCGA ACACGTTCGG
GAGCGCCCCT TCCGCGAGGA CGTGCCGGTC GAGACCGTCA CCCGCTCGGC GTTCATGAAT
GACTCTGCGA GCGCGGGCGC GGGCGGCTCG GACCCCGAGT TCCACCGCTG GAACGATCAG
GTGTGGAAGG CCCTGTTCGT CGTCGGCGAG GACGAGAACG CCTCCGACGC GATTGACAGC
GTCTACGGCG GTGCAGTCTC CGGGTTCTAC TCGCCGGCCG ACGACCGGAT CGTCCTCGTC
GTCCCGGAGG GAGAGGACCC GCAGATCAAC CCGTCGACGC TGGCACACGA GCTGGTCCAC
GCGATGCAGG ACCAGTACCA CGACCTCACC CGGCCCCGCT ACGTCGGCAC TACGCAGGAC
GCCGACCTCG CGGTCGACGG GATCGTCGAG GGCGAGGCGG TCCACATCGA GGAGGTGTAC
GACGCGCGCT GTGCCGGCAA CTGGAGCTGT CTCGCCGCGC CCGACTCCGG TGGCGGCGGC
GGGTCGGCGG CGGACTACAA CTTCGGCATC CTCCAGACCG TGCTTCAGCC GTACGCCGAT
GGCGCGCTCT ACGCCGAGAC GCTCGTCGAC GAGGAGGGGT GGAGCGCCGT CAACGAGACC
ATGAACCGGC CGCCGAACGC GACCTCGGAG GTGATCCACC GCAACCCCGA TTACGAGACG
ACCGAGGTAA CGTTCGAGGA CACGGCCACC GGCGGGTGGG AGACGTATCC GAATCAGGGG
GTCAACGGCT CGGAAACCGC CGGCGAGGCG TCGATGTTCG TGATGTTCTG GTACCAGAGC
TACGAGTACC GCCACGCGGT GTTGGACCCG GACGCGACGA TCCGGGATAA TATCCAAATT
CACACGCAGC CGGACGAGCG GCTTCGAACT CGTGCGAACT ACAACTACGC CCACGAGGCG
ACCGACGGTT GGGCGGGCGA CGAGCTGTAC CCCTACCGGA ACGACGGGAA CGCGGACGGG
GACGACGCGA GCGCGACCGA CGGGGAGGAC GGCTACGTCT GGGTGACCGA GTGGCAGACG
CCCGCGGACG CGACCGAGTT CCGCGAGGCG TACCTGCGCA TGCTGACCGC CCACGGCGGC
GACGACCACG CCGCGGGCGA GGTGTACGAG ATCGCGGACG GCGACTTCCG CGGGGCCTAC
GGCGTCGAGC GAAACGGGAC CACGGTGACG ATCGCGCACG CCCCCGAGCC AGCCGACGTG
CTCGATCTCC GGCCGGAGGC CGACCTCGAA CTCTCCTCGA CCGACGACGG CGACGACGCG
AACAGGACCG ACGGGGATGA CGGAACCGAC GGAGACGACG CGGACGGGAC CAACGACGGA
ACCGATTCTG ACGATGGAGA CGACATCGAC CCGGACGGCG ATGACGCCGA CGGCTCCGCC
GGTTCCGACG CTGCCACCGG CGACGACGTG CCCGGGTTCG GTCCCCTCGT CGCGCTTGTC
GGCATACTCG CGACGGTAGC GCTCTTTGTG CGCCGCGTAC GGCCCTGA
 
Protein sequence
MRRVLTLSVI VAVIALLSIG AGGAVAAGPA VAGSTAAIPG DAPASATATE CDAGDGTDLV 
GCWNGTHYEE ELAFNQTDGL TEAELEELTH LTMARVEHVR ERPFREDVPV ETVTRSAFMN
DSASAGAGGS DPEFHRWNDQ VWKALFVVGE DENASDAIDS VYGGAVSGFY SPADDRIVLV
VPEGEDPQIN PSTLAHELVH AMQDQYHDLT RPRYVGTTQD ADLAVDGIVE GEAVHIEEVY
DARCAGNWSC LAAPDSGGGG GSAADYNFGI LQTVLQPYAD GALYAETLVD EEGWSAVNET
MNRPPNATSE VIHRNPDYET TEVTFEDTAT GGWETYPNQG VNGSETAGEA SMFVMFWYQS
YEYRHAVLDP DATIRDNIQI HTQPDERLRT RANYNYAHEA TDGWAGDELY PYRNDGNADG
DDASATDGED GYVWVTEWQT PADATEFREA YLRMLTAHGG DDHAAGEVYE IADGDFRGAY
GVERNGTTVT IAHAPEPADV LDLRPEADLE LSSTDDGDDA NRTDGDDGTD GDDADGTNDG
TDSDDGDDID PDGDDADGSA GSDAATGDDV PGFGPLVALV GILATVALFV RRVRP