Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0190 |
Symbol | |
ID | 7402119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 204282 |
End bp | 207578 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643707253 |
Product | hypothetical protein |
Protein accession | YP_002564865 |
Protein GI | 222478628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.363536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAAAGC TATCGCCGGG AACGAACGCG TCGTGTTCCG GAACGCGGTG TCGGTCCTTC ACCGTCGACG AGCGTGCACG AGTCCCGTTC GCGCTGATCG GTGTTCTCCT CCTCGTGACG AGTTCGACGT ACGCCGCAGG GATCGCAGAT CAGGGACTCA TCGGCGAGGA CCGGAGTGTG GAGCGCGCCG TCGAGCGCGT CGACGCCGAT TCTACCGCCG CGCTCAGGGC CGCAGCGCGT CAGGCCGCGC ACGACGCGGC CGCAGAGCCG GTGACGCGAG CGCCGGACGG ATCAGCCGCC GGCTCGGAAG ACGGACACGG ATCAACCGCA GTCAGGAACG GATCGGCCTT CGAAGACGCG TTCCGGATCC GGCTCGCGAT CGCCGGTTCC GAGGCGCTGG CAGCCGTGGA CGCCGAAGTC GGTGACGTGA CGGCGACGGC GTCGCTACCG GCGGTAGACG GTCCGAGTGA GCTGGCGGCG GCGCGCGATC GAATCCGCGT CGAACCCGCC GCGAACGGAA CAGCGACACG GGTGACGTTC GAAGGGGTAG AAACAACCGC GACGCGGAGC GGACGCACCG TCGTGGAGCG AACGGAAGAC CGGACAGTCG TGGTGGCCGT GCCGACGCTC GCGGCACATG AGCGGACGGA GCGGTTCGAG GAGCGGCTTA ACCGGGGTCC GGTCGAGGGG CCGGGGTTGG GGCGCCAGAT CACCGCAAGT CTCTACGCGA TGACATGGGC GCGCGGGTAC GGGCAGTACG CGGGCGCGCC GGTCGAGAAC GTGCTCGCGA ACCGCCACGT GGAGCTGTCG ACGAACGCGG GGATCGTCCG GACGCAGCGT GACGTGTTCG GCACGAGCGA CCCGAACGCA CGCGGGGGAG TGGCGCGCGC GACGGCGCGG ACCGGAGTCA CCGACCTGCT CAAGCCGACC GGCGTCGACG AGGGGTCGTG GACCGAGGCC GTGCTCAGCG CGCCGACACC GGGGGTACGC GATGGTCCGG GCGAGAGAGA CGGTTCGGGC GAGAACGACA CCCCGGGCGA TCAGCCGAGC GAGGACGCTG CGTTCGAGTT CGGAGAAGAG CGGACGGACG AACGGACTTC GGTTCCGGTC GGCCGTGCAG CCGACGAGGG GGCGACGCGC GTCTACGACG ATCTTGACGC GATCATCGAG AGCGCGTATC GGGTCGAAGC ACGCGTCGAG ACGTCGACGA CGCAGGTTGT CGACGGCGGG CGTCCGACGC CGCCGACACC CTCTCCCTCG GAATTCACGA CGACCGGCGA CTGGGAGCGC GTCGACCTGA CACGGACAGA ACGGTCGCCG ATCATCTCCG GATCCGGCGT ACCGGAGGGT GTCCCGTCGG GAATCGTCGA CCCCGGCGAG CGTGTCTCGT TCGGCCTCGA AACCCGCGAG GCGACCGTCA AGCGCGTTGC GGTCGCGGAG TGGGAGCGAG TGACGGTCGA GCGCGGCCCG AACGGGAGCG TCGTCGACGA GCGGGTCCAT CGGGCGACCA CACGCGACGC GGTGACCGAC CACTATCGGG TAAGGGTGTC GGTATCGGGG GAACACGCCC CGACCGACGG AGCGCCCGAC CGGGCCACCG CGACGTTCGG CGCGGGCGAC GCGAGCGACG GTCCGGACCT GCGCGACACG CCCCCGATCG CACGGGTCGA CCTCGATGTC GACACCGAGA ACGGCGTCGA TCGGATCACC GAAGACGCGA TCAGCGGTGG GGAGGTGACG CGATCAACGA CCGTTGTCGG CTCCCGATCG GAGGCCGATC GGAACGCCGT TGCGGCCGAC GTGGCCGCAC TCGCGAGCGA CGTGCGCGAC ATCGAGACCG AGGCGTCGAT GGAGGATGCA GCGGTCGGCG AGGCGGAACC GTACGCCGAT CTCGCCGACG CAATCCGCGA CCGACGCGCG GAACTTGTCG ACGCGCCGGT GACGTACGAC GGAGCCGCTG ATCGGGCCGG AACCGCTGCC CGCACCGCGT ATCTCGACGC CGTAATCGAC GAGCTCGAGT CCGCCGCAAG CGACCGCGAG CGCGCGACTG ACGGGTTCCT CGACCGCGTG AACAGCGCCT TCGACGGCCC TGACGTTGGC GACGCTCTCG CCAGCCGAGA GGCCGCCCGC GATTCCGGGA CGTACGCGAT CGGCGAGGAC GGCCCCGGCG GAGCGGTGAC GTTCGAACCG AACGGCTCTC CGGGATACCT CCCGCGGACG ACCGTCGACG GGGAAGCCGT CGACGGCGTG GATGGGACGA CGACGCGACC GCTCGCGGTC CGCAACGTGA ACTACGTCAC CGTGCCGTAC GGGGAGGTGT CGAGCGGCGT CGTCGATCGG ATCCTCGGTA CCGAGGACAC GGTACGGGTC GGAACTGCGG GCCGGTCGCT CCTCCTAGCG AACGATGCGC TGGCAGCGGA CGACGATCCC GACCTGCGAG CAGATCGGGA TGCACTTGCC CGGCAGATAG ACGGGTCGCT GGACGAGGTT GACGGGGCGC TCGGATCGAC ACTCCGCGTC CGGACCTCGC TGTCGCGAGA CGAGCGACGC AGAGCGCTCG ACGAGGCCGC CGCGAGCTAC GGTTCCCCCG GCGAGCGTGC GGTTGCCGTC GGAGACGGAT CCTATCCGGA TCGCGTCGCC GCCGAGGCCG CAAGCGTCGG GTCGCTGTCG CGGGCGGCCG AAGAGGCGCT CGCCGCGAAC TTACGCGTCG CGACGCGGAC CGCGGCCGGG AGGGACGCGG TTCGGGTCCC GACCCGGTTC GTCGACGAGA CGACGAGCGG AGCGCGGGTG TTGCTCAGAG ACCGGATGGA AAAGGCGGTT GAGAACCAAG CGGCGCGCGC GGGCAAAGCA GCTACCGGAA AGGTGAGCGA GAAGGCTGTA AAAGAACTGG GCGAAAAGTG GTCGATGAAG CCCGCTCGAA CCGTCGGCGC AGGGCTCCCA GTCGCGCCCG TTCCGGGGTA CTGGGTGACG ACGGTGAACG CCTGGCGCGT GCAGATCCGC GGGGAGTACC CGCGATTCGC GCTGCGCGCC AATGTGGGAA CACCCGATAA ACGCTTTACG TACGTCCGGA GCGAGGGCGA TGTGACCGTC GACGTGGGCG GAGAGACGGT GCAACTGGGC AAGACGGAAC CGATCGCGTT CGAGGCCGAG ACGGTAGTCG CCGTCGCGGT CCCAGCCGGC CCGCCCGGTG TCGGCGATGT CGACGGAACT CGCGATGAGC GGTCAGGGGG ATGGCCATGC CCCGGCGCGA TGGGGGAATC GCCGTCGGGC GGCGAGGGGA AGTGTTCGAA GCCGTGA
|
Protein sequence | MGKLSPGTNA SCSGTRCRSF TVDERARVPF ALIGVLLLVT SSTYAAGIAD QGLIGEDRSV ERAVERVDAD STAALRAAAR QAAHDAAAEP VTRAPDGSAA GSEDGHGSTA VRNGSAFEDA FRIRLAIAGS EALAAVDAEV GDVTATASLP AVDGPSELAA ARDRIRVEPA ANGTATRVTF EGVETTATRS GRTVVERTED RTVVVAVPTL AAHERTERFE ERLNRGPVEG PGLGRQITAS LYAMTWARGY GQYAGAPVEN VLANRHVELS TNAGIVRTQR DVFGTSDPNA RGGVARATAR TGVTDLLKPT GVDEGSWTEA VLSAPTPGVR DGPGERDGSG ENDTPGDQPS EDAAFEFGEE RTDERTSVPV GRAADEGATR VYDDLDAIIE SAYRVEARVE TSTTQVVDGG RPTPPTPSPS EFTTTGDWER VDLTRTERSP IISGSGVPEG VPSGIVDPGE RVSFGLETRE ATVKRVAVAE WERVTVERGP NGSVVDERVH RATTRDAVTD HYRVRVSVSG EHAPTDGAPD RATATFGAGD ASDGPDLRDT PPIARVDLDV DTENGVDRIT EDAISGGEVT RSTTVVGSRS EADRNAVAAD VAALASDVRD IETEASMEDA AVGEAEPYAD LADAIRDRRA ELVDAPVTYD GAADRAGTAA RTAYLDAVID ELESAASDRE RATDGFLDRV NSAFDGPDVG DALASREAAR DSGTYAIGED GPGGAVTFEP NGSPGYLPRT TVDGEAVDGV DGTTTRPLAV RNVNYVTVPY GEVSSGVVDR ILGTEDTVRV GTAGRSLLLA NDALAADDDP DLRADRDALA RQIDGSLDEV DGALGSTLRV RTSLSRDERR RALDEAAASY GSPGERAVAV GDGSYPDRVA AEAASVGSLS RAAEEALAAN LRVATRTAAG RDAVRVPTRF VDETTSGARV LLRDRMEKAV ENQAARAGKA ATGKVSEKAV KELGEKWSMK PARTVGAGLP VAPVPGYWVT TVNAWRVQIR GEYPRFALRA NVGTPDKRFT YVRSEGDVTV DVGGETVQLG KTEPIAFEAE TVVAVAVPAG PPGVGDVDGT RDERSGGWPC PGAMGESPSG GEGKCSKP
|
| |