Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1911 |
Symbol | |
ID | 8384202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1924016 |
End bp | 1927858 |
Gene Length | 3843 bp |
Protein Length | 1280 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644972979 |
Product | hypothetical protein |
Protein accession | YP_003130813 |
Protein GI | 257052980 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.281193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC ACCGAATCCA CGATATCTTC CAGCAGTCAC CGACCCGCGA ACTCGAGGAA GTCCAGAAGG TCAATGCCAG AGCGCAGGCC GAAAACGACG TGCGCGAGTT CTACGAGACC GACAGCGCAC GGAGCGTCCT CACTACGCTC GGCAACCTCG TCGACAAGTA CCCACAGGAG GAGCCGCGCT TCCTCTACAT CTCCGCGACG TTCGGGTCGG GGAAGACCCA CCTCCTGAAG CTCGCCGGAT TCGCCGCCGA CACCGAGTCG AAGTTCGCTG ACCTCGGTGA GGAACTGGCG AGTCGGTGGC CGGGGTTCCA GTCGTTCAGA CAGAGCATCG CCGACTCCCA CGTCGACCGC CTGAAGCCGG TCTTCCTCAA CCTCCTCAAC AGAGACGCGT CACAGGAGCC GCCACTCCCC TACCTCATCT ACGAGGTCAT CGGCCGCGAA CTCGGGTATC CGAACGATCC GAACTGGCTG CTCGAATGGG CGTGGCAGCT GGACATGAAC CACGGCGACT GCTGGGAGCT ACTGCAGGAG ATCGAATACG ACGGGCGAAC GTTCGACGAG GTGTACGACG AGCGGGCCTC GCTACGAAGC TGGCTCTACG GGGCTGTTCC GACACTCGAT GACACGCCGT ATCGCTCCAG TAGCGAAGTA AAACAGTCGA TCGATACTGC GATCGAGGAG ATCGACCCGG ACGAATTCGA TCCGGACGAA CTCGTCGACC GGGTCGAAGC GGCACAAGAG GCCCTGAGTA CGCCCGAGAC GGAGACGGAG CTGTTGATCG GCCTCGACGA GGTGGCACTG TTCATCGGCG ACGGCCGCCA CCGCTACCGG GAGTTCCAAG AAACGATGGA GGCCCTGACC AACCCCGCGA CGGGGCCGAA CCCACCCATC GTCGGGACGG GCCAGTATCC CTTCGATCGC ATCCACGGGG AGTTCGAGGA CTCAGACGTC ACCGACGAAC CGTGGTGGGG CAAGCAGGAG CCGCTGGAGG GGGCCGACAC CGAGATCATC GTCCGCAAAC GATGGCTCCA GAAGGACGGC GACGGCGAGC AGGCGGTCGA CGCGGCGATT CATGAGCTTC CCGACCTCAC GCTTGATACG TACGCGGACA TCACGGGCGC GGATCCCGAT GCAATCGAGT CGTACCCGTT CCGCGAGTAC GACCTCGGAC TGTTGCGCAC GGTGATCCAG CAGCTGATGC CGCGCGGTCG CGTGACCGAG GAGGAGTACG TGCAGGGGCG AGCCCTGCTC ATCCTCGTTC GGTCGCTGTT CACGCGGTTC GAGTGGGGCG AGAAATCCGT CGGTGCGCTC GTCACGTGGG ACGAGTTGTA CGACCTGCTG GTCGAGGAGA CGACGTACGT CCCGCTGTGG GTGCAGGAGA TGGTCGAGAA CAAGCTCGTC CCGTCCGCAG GTGGAGATGA AGATGCCTTC TCGGTGCGAG TAGCGAAGGC GCTGTACCTC CTGAATCAGG TGCGCTCGGA GGTACCGTCC ACGCCGGCGA ACCTCGCCCG ACTGATGGTG GAATCGGCCG ACGCATCGCT CGAAGAAGTC CAAGACGATG TCGAGTCGGC GCTGGCCGAC CTCGTTGAGG ACAGAAAGGT GCTGACCGAG ACGAGCGACC GAGGTGACGA GGAGTACCTG CTCGTCTCCG AGGAGCAGGA GGACATCCTG ACGCGGGCGA AGACACGTGC TCAACAGATT TCGTCGCATC GCCTGTCGGC CAAGCTGGAG ACGTACCTGC AGGAGGGGAG TGACCACCTC CTCTCGGCTG GCAGTCGACA CGAGGTCGAT CTCGACGGCG AGCGCCGGGT GCCGCTCCGG TTCGATTACT CCGTCCTCGA TCCGATCGAG CGGGCTCCCA CGCCGGAGTT CGATGCCATT CGCGTCCGGC TTGTTGCCGA CCGCCCCGAC ACGGTGGCCG ATCAGGTGGA CGCGTGGCAG TCGACCAACG AGGGACAGGA CGGGGGCGAA CACGTCCTCG TAGCCGTGGA GATCTCGGAG TCCACGGTTG AGCGTCTGCG GGACGTGATG GGAATGCAGG AAGTCCTCTC GGAGGAGACG GAGACCTATC CGGACTTGGA GTCCGACCAT CGCGACGATC AGCGTGCGCT CGAATCGACG GTTCGCGACC AGCTCAACGA GGCCGACATC TACGTCCGAG CCGGCGACAC GAGGGGTCGT TACGGCGACG TGTTCGAGCA GGTCGTGACG AATCAGGTGC AGTCGGTGTT CGGTCGGACG AGGCACGCAT TGACGAACGG GATCACCGAG GTCGACGATG CAAAGCAGAT GGCGCGGTTC TTCCGCGGTG TTGATGACTG GCCGCTGTCG AGCGAAGACG CTGTCACACT CGGTGTGGAC ACCAACCGGG GGGAACTCGC GGACGGGTGG TGTCAGGAGT TCCTCGACGA CTACGAGAAC ACGCAATCGC TCCGCGGTGA GGACTTGCTC GCTCAGACCG TCCAGCGCGG GGGCAAATAC CGCGGGACGC CACGGGAGTC CATCGCGGCG CTGCTGATCA CGCTCGCCAC GGCCAACGAG ATCGCTCTCC GCCGCGACGA CGAGTACATC ACCGAGCCCG AGGAGATCGG ACGGGCGGTC AGGAACAAGA CGAATCTGAC CGGTCTACAG ATTCGATTCG AGTCGCTGGA CGGCATCGAT CCCGACCAGA TTCGGGAGAC AGTCCGGACG CTCATGAGCG AATCACCGGA GGGTAGCGAC CCGGATGCGT GGCTCTCGGA GCTGGCTGAG TGGGTCGACG AGAATAGCGT TCTCGTGAAA CGCGTCCTTC GGGGCGTCAG CAGAGAGTTT GGCGAAGGTG CATCACTGGA TGAGTTGGAA AACGCCCTCG AACCGGCACT CGGCGGAGAG GCACTCGAGA CCGAGGACTT CGCGACCGAC GAAGTCGAGC GAGAGGCCGA ACGGTTCGCT CGTGCCAGAG GGCTGTTCCG TCCAGATGAG GACGATACGA CGCTGTGGGC GCGGTTCAGC GAGCGCACGG CCGAGATGCA GCGGCTCCAC CCCGGGGCAG ACGTCACCGG CGATATGCAG AGCATCGCCG GCGGTAACGA GGTACCCGAC GCTGAGCGTC TGCGGACGCT GATCGATGAG GCAGACGCCC ACAGGCAGCG CGTCGTCCGA GAACAGTACG AGCGCATCAC CGGTGAGACG CCAGCCGACG AGGAGTCGGA GAGTGTCGTT TCCAGCCTCG CGACGTGGTT GTTCGCTCAC GACGGAAGCA GCAAGGAAAT CGCCGACCGC GTCTCCGTCG AATTCGGGGG TGTGACGATC GACGACCTGT ACGATCTCTT CGAGACGGCG TGGGAAGGTG ACTCATTCTC GGAAGAGGAA CTCGTCGATC CAGCGGTTGT CCAGCAAGCC GAGCGATACT CGAGGGCGCG GGAGCTCCTC GAATCACCCG AAGGCGAGGT CAGTCTTTGG TCGAAGCTCC AAGACGCGTC GAGGCGACTT GAAGAGGAGT ATCCGAACCA CCCAGTCACA AGCGACGTGG AGGAGACGCT ATCTCGGTCA CAGCCGCCGA GCGTGGATGA AGTGGAACGG CTTCTTGATA AGGCAGAGAG CCCATTCGAA GTCGACGAAC GGCTGGCAGA GCTCGCCGAC GAACTGCAGA CCGAGTACCC AAATCACGAA GTTACCGCAA CAGTCGTTAA CGCCGTTGAA GGACCCAGCC AGCCGAGCGA CGAGCGCGTG GGCGAATTGA TTGAAGAGGC TGAACAGCTG CTTGAAGGTG TGGATGAGCA GTTGCGGCGG ATTAGAGAAA CGATGAACGA ACTTGACGAT GGGTCCGTTG TGGTGGTCGA GTCTCTCGAT TGA
|
Protein sequence | MSDHRIHDIF QQSPTRELEE VQKVNARAQA ENDVREFYET DSARSVLTTL GNLVDKYPQE EPRFLYISAT FGSGKTHLLK LAGFAADTES KFADLGEELA SRWPGFQSFR QSIADSHVDR LKPVFLNLLN RDASQEPPLP YLIYEVIGRE LGYPNDPNWL LEWAWQLDMN HGDCWELLQE IEYDGRTFDE VYDERASLRS WLYGAVPTLD DTPYRSSSEV KQSIDTAIEE IDPDEFDPDE LVDRVEAAQE ALSTPETETE LLIGLDEVAL FIGDGRHRYR EFQETMEALT NPATGPNPPI VGTGQYPFDR IHGEFEDSDV TDEPWWGKQE PLEGADTEII VRKRWLQKDG DGEQAVDAAI HELPDLTLDT YADITGADPD AIESYPFREY DLGLLRTVIQ QLMPRGRVTE EEYVQGRALL ILVRSLFTRF EWGEKSVGAL VTWDELYDLL VEETTYVPLW VQEMVENKLV PSAGGDEDAF SVRVAKALYL LNQVRSEVPS TPANLARLMV ESADASLEEV QDDVESALAD LVEDRKVLTE TSDRGDEEYL LVSEEQEDIL TRAKTRAQQI SSHRLSAKLE TYLQEGSDHL LSAGSRHEVD LDGERRVPLR FDYSVLDPIE RAPTPEFDAI RVRLVADRPD TVADQVDAWQ STNEGQDGGE HVLVAVEISE STVERLRDVM GMQEVLSEET ETYPDLESDH RDDQRALEST VRDQLNEADI YVRAGDTRGR YGDVFEQVVT NQVQSVFGRT RHALTNGITE VDDAKQMARF FRGVDDWPLS SEDAVTLGVD TNRGELADGW CQEFLDDYEN TQSLRGEDLL AQTVQRGGKY RGTPRESIAA LLITLATANE IALRRDDEYI TEPEEIGRAV RNKTNLTGLQ IRFESLDGID PDQIRETVRT LMSESPEGSD PDAWLSELAE WVDENSVLVK RVLRGVSREF GEGASLDELE NALEPALGGE ALETEDFATD EVEREAERFA RARGLFRPDE DDTTLWARFS ERTAEMQRLH PGADVTGDMQ SIAGGNEVPD AERLRTLIDE ADAHRQRVVR EQYERITGET PADEESESVV SSLATWLFAH DGSSKEIADR VSVEFGGVTI DDLYDLFETA WEGDSFSEEE LVDPAVVQQA ERYSRARELL ESPEGEVSLW SKLQDASRRL EEEYPNHPVT SDVEETLSRS QPPSVDEVER LLDKAESPFE VDERLAELAD ELQTEYPNHE VTATVVNAVE GPSQPSDERV GELIEEAEQL LEGVDEQLRR IRETMNELDD GSVVVVESLD
|
| |