Gene Hlac_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1511 
Symbol 
ID7401437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1523726 
End bp1525756 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content68% 
IMG OID643708573 
Producthypothetical protein 
Protein accessionYP_002566169 
Protein GI222479932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0798411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACA GCGCATTCAC ATTGGCGATA GCGGTCGTAG TGGTCGTGTC GGCGGTCGCC 
GGCGCGGCTG GTGCGGTCGC GGCACAGAGC GGTACCAACG AGGCGAGCGT GAGCTTCTCC
GACGGAACGT CCGGCGGGAG CACGGTCGTC GTCGACGAAG TCCACGTTCC CGACGGCGGG
TTCGTGACGA TTCACGACGG ATCGCTGACC GACGGGAACA CCCTCGGGAG CGTCGTCGGC
ACGAGTGCGT ACCTCGAACC GGGGACGCAC GCGAACGTGA CCGTCGAACT CACCGACGCG
GTCGAAACGG GCACGTTCCA CGCGATGGCG CACCGCGACA CCGACGGTGA CCGCGCGTAC
GCGTTCGTCT CCTCAAACGG CGGGGCAGAC GGGCCGTACA CGGTCGACGG CGACATCGCG
ATGGACGACG CGAACGTGAC GGTCTCCGCG AGCGTGACCG GGACCGACCA GCCCACCGAG
GGCGAGTACG TGATCGTCGA CCGCGTCGAG CTGAGCGACG GCGGGTTCGT GACGGTTCAC
GACTCCTCGC TCGCTGACGG CGCGGTGTTC GACTCGATCC GCGGCACGAG CGCGTACCTA
GAGGCGGGCG TCCACGAAGA TGTCCGTGTC CAGCTCGATG ATCCCCTGCA GAACGACGAT
ACGGTGTTCG CGATGGCGCA CCGAGACACG AACGGCAACG AGGCGTACGA CTTCCCCAGC
AGTGACGGCA GCGAGGACGG TCCGTACCTC GACGCGAGTG ACGAGATCGT GATGGCCGGT
ATCGACGCCG AACTCGACGA CGAGGCGCGG TCGAGCTTCG ACGCCCAGAC GTCGGGCGGT
AACGCGGTCG TCGTCGACGA GATCTACCTG CCGGAGGGCG GGTTCGTGAC GATGCACGAC
TCCTCGCTCG CGGACGGCGC GGTGTTCGAC TCCATCAGCG GCACGAGCGC GTACCTCGAA
CCGGGGATCC ACCGCGATGT CGTCGTCCGG CTCGACGACC CGCTAACCGA AGATGACGCG
CTCTTCGCGA TGGCGCACCA GGACACGAAC GGCAACGAGG CGTACGACTT CCCCAGCAGT
GACGGCGCCG AGGACGGTCC GTACACGACC GATTCGTCCG ACATCGTGAT GGACGACGGG
AACGTCACCG TCTCCGCGAG CGTGGCGTTC GAGACGGACG GCTCGGACGG GACGGCGGTC
ACGGTCGACC GAGTGGACCT GAGCCAAGGC GGATTCGTGA CGATTCACGA CGCCTCGATC
GGCGGAGGCG CCGTGTTCGA CTCCGTTCGC GGTACCAGCG CGTACCTGGA GGCTGGCCTC
CACGAGGACG TGACGATCGA ACTCGACGAG CCGCTGACGG ACACGGAGCA GCTGGTCGCG
ATGCCGCACC GTGACACCAA CGACAACGAG GCGTACGACT TCGTCGACAG CGAGGGAGGG
GCCGACGGTC CCTTCCTGAC CGGTGAGGAC GCTCCCGTGA CGGCCGGAGC GACCGCGCAG
GTGACTGCGT CCGTCGGCGC GATCGCGCAG GACACCGACG GCGAGACCGT CGTCGTCGAC
TCGGTGACGC TCCACAACGG CGGATTCGTG ACGGTTCACG ACTCCTCGCT CGCGGACGGC
GCGGTGTTCG ACTCCATCCG CGGCACCAGC ACGTACCTCG GGCCGGGAAC GCACACCGAC
GTCGAGATCG CGCTCGACGA CCCGCTGAGC GAAGATGACA CGGTGTTCGC GATGGCGCAC
CGCGACACCA ACGCCAACCA GGCGTACGAC TTCCCCGCCA CAGATGGCGA CGAAGACGGT
CCGTACACCG CCGCGGGCGC GCCGGTGATG TCCGCCGCCG ATCTGACCGT CGAAGCGGGT
GGGGACGCCG GCGATAGCAT GTCGGACGGC GACGGAGCCG ACAGCGGGGA GATGAGCGAC
GACGGTGCAA GCGACGAGGA AGGAAGCGGC GACGAGGCGC CCGGGTTCGG CGCCGTCCTC
ACGCTCGTCG CGCTGATCGC GGTCGCGCTC GTCGCGCGCC GTCACACCTG A
 
Protein sequence
MRNSAFTLAI AVVVVVSAVA GAAGAVAAQS GTNEASVSFS DGTSGGSTVV VDEVHVPDGG 
FVTIHDGSLT DGNTLGSVVG TSAYLEPGTH ANVTVELTDA VETGTFHAMA HRDTDGDRAY
AFVSSNGGAD GPYTVDGDIA MDDANVTVSA SVTGTDQPTE GEYVIVDRVE LSDGGFVTVH
DSSLADGAVF DSIRGTSAYL EAGVHEDVRV QLDDPLQNDD TVFAMAHRDT NGNEAYDFPS
SDGSEDGPYL DASDEIVMAG IDAELDDEAR SSFDAQTSGG NAVVVDEIYL PEGGFVTMHD
SSLADGAVFD SISGTSAYLE PGIHRDVVVR LDDPLTEDDA LFAMAHQDTN GNEAYDFPSS
DGAEDGPYTT DSSDIVMDDG NVTVSASVAF ETDGSDGTAV TVDRVDLSQG GFVTIHDASI
GGGAVFDSVR GTSAYLEAGL HEDVTIELDE PLTDTEQLVA MPHRDTNDNE AYDFVDSEGG
ADGPFLTGED APVTAGATAQ VTASVGAIAQ DTDGETVVVD SVTLHNGGFV TVHDSSLADG
AVFDSIRGTS TYLGPGTHTD VEIALDDPLS EDDTVFAMAH RDTNANQAYD FPATDGDEDG
PYTAAGAPVM SAADLTVEAG GDAGDSMSDG DGADSGEMSD DGASDEEGSG DEAPGFGAVL
TLVALIAVAL VARRHT