Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0083 |
Symbol | |
ID | 8409580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 79202 |
End bp | 81013 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645018410 |
Product | Heparinase II/III family protein |
Protein accession | YP_003175930 |
Protein GI | 257386157 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.137335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGATT TCTATTCCGA GCGTTCTCAA AGCGTGGGTG AGCGACTTGC CCTTGGAGTA TTCACAGCTG GACGACTGCA GCGGGAGCAG GTTCTGGGCA TTCTTGAGCG AAAAGTCCGT CACGCGCTCC TTCCAAGGCT CCCTATTGAC TTTGACGCAC GCTATGACGC ACAGGTACCC AATAAACTCA AACTGACGGC TGGTCCCATT AATGAGAACA CAAGACGTCT CAGAGACAGT CTAGAACCGG AGACCAAACA ACGTTATCAA GAGCGGTTAA ACCATTTTGA AAATGGACAA CTCTCGTTTA TTGGTCATGA AGTTTCCCTT GACGCTGACC GGACGATCGA TTGGGACGAT GATCAACTGG AGGGAATACC CTTGCTCTGG TGGCTCAAAT ACCAATCGCT CGAACCGCTC AAATGGTTTC TCTTCAGCCC CGAGGAGTCT ATAGACCGGG AACGAGTTGT CCGGGAGGTT CTTGACCCAT GGGTCCGAAC GATGGCGTCA ACGGCACAGA TTGGCACCCC TGAATATCTT CGGCGCGACT GGATTCCACA TGCAGTCTCG CTTCGAGTCA TGACTCTTTC TCGGTACTCT GCCTGGCTCG AAAGTGAAGG AATGATTGCT TCTAGGCAAT TAGTATTAGA ATACTTGTAC AAGAATGCAC TATTTCTGCA GAATCATATA GAAACCGATG TCGGTGGGAA CCATCTAATT GAGAACGCTC TGGCCTTGCT TATGGCTGGA TTGATCTTTC AGCAGGCCGA TGAATGGATC CGAAGCGGAA CGGAATTATT TGAGCGAACG GCTATAAACC AGTTTCTCGG GGATGGCAGT CATTTTGAAA AGAGCCCGAT GTATCATATT ATGTCTCTTA GGAGGATGTT AACTTCCATT GGACTTTTAT CCAAGTATGG TTACTCCATC CCACAGACCA TTCAACATAG TGCTGAGAAG GCGACAGAGT TTTTAACATA CCTAACTCCA CCAGATGGAC AGATTCCCCT ATTAAATGAT TCTCAGTTTT ATCAAGTACT TTCGATTTCA GAGATTTTAC TGTATGCTGA CAAAATAGAC ATTACGGCGG ACAACAATCC AGACCAGTTT GGAGCTTCAG GGTACTATTG GCTAGGAAGC GGTGCTGATC GTATGCTTGT TGATGGTGGA TGTGTTGGCC CTCCCCATTT GCCTGGCCAT TCTCACATTG ATCTCCTCAG CATTCTGCTG TGGCTTGATG GTAACCGTGT ACTCACCGAT ACGGGGGTCT ACCAGTATGC AGATGATGTT CACCGTCAGT ATGCGAGGAG TATTAAAGCA CATAATTCAG TTCAAGTCGG TGATTGCAAT CCGATCAAAA TCGGTGGACA ATATCTTATG GGTCGCCGGA CGGCACCTGA AACTACCATG TCTCGCAACG AATCATTTAA GCAATTTGAG GGGAAGTATG AGGTAGAAGG ATTTCTCAAA CCGTTATACA CACATAAAAG ACGAATTCAG ACAAATGGGA AGTGGTGGCT GGTCACTGAT GAGGTCTCCA GTCACGGGGA TCGCCCCATT ATCTCACGAC TTCATTTTCA TCCCTCAATA GATTTGAAAA AAGTAGATGA TGCAAAATTC AATATAATTT ATGATTCTGA ATATCTGCTC GGGTATCTCC AAGCATTGCA CTGTGAGAGT ATCGTACTTG ACACAAGCCC CTACTTTCCC GAATTTGGGA CCAAGATTAG CAGAAAGGTG CTTGAACTGA GATACGAATC GACTCCCGCA GGATATGTTA TTTCAAAAAA TGAAGGTTCT GAAGTGAAGT GA
|
Protein sequence | MNDFYSERSQ SVGERLALGV FTAGRLQREQ VLGILERKVR HALLPRLPID FDARYDAQVP NKLKLTAGPI NENTRRLRDS LEPETKQRYQ ERLNHFENGQ LSFIGHEVSL DADRTIDWDD DQLEGIPLLW WLKYQSLEPL KWFLFSPEES IDRERVVREV LDPWVRTMAS TAQIGTPEYL RRDWIPHAVS LRVMTLSRYS AWLESEGMIA SRQLVLEYLY KNALFLQNHI ETDVGGNHLI ENALALLMAG LIFQQADEWI RSGTELFERT AINQFLGDGS HFEKSPMYHI MSLRRMLTSI GLLSKYGYSI PQTIQHSAEK ATEFLTYLTP PDGQIPLLND SQFYQVLSIS EILLYADKID ITADNNPDQF GASGYYWLGS GADRMLVDGG CVGPPHLPGH SHIDLLSILL WLDGNRVLTD TGVYQYADDV HRQYARSIKA HNSVQVGDCN PIKIGGQYLM GRRTAPETTM SRNESFKQFE GKYEVEGFLK PLYTHKRRIQ TNGKWWLVTD EVSSHGDRPI ISRLHFHPSI DLKKVDDAKF NIIYDSEYLL GYLQALHCES IVLDTSPYFP EFGTKISRKV LELRYESTPA GYVISKNEGS EVK
|
| |