Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0958 |
Symbol | |
ID | 8534100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1027089 |
End bp | 1033493 |
Gene Length | 6405 bp |
Protein Length | 2134 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 646383339 |
Product | KR domain protein |
Protein accession | YP_003262843 |
Protein GI | 261855560 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCAAC CGACAATTAA CTCAGAAACT ATGGTGAAGC AGCTTCTGTC TGAAAAACAT GAACCAATAG CAATTGTTGG TCTGAGCTTG CGTCTCTCAG ACGGCATTTC TGATCTGAAT GGTTTTCACC ACCTGCTTCT TGATGGTCGT GATAATATCG TTGATATTCC TGAATCACGC TGGGACAATA AGAAATACTT TTCTTCCGCA AATGATTCCG GCAATGCCAT TTGTACCAAT AGGGGGGGGT ATTTAAACGA TATAGATAAA TTTGACCCGG CTTTTTTTTC CATATCGCCA AAAGAAGCGC GATATATGGA TCCTCAGCAA CGGATCATGT TGGAATTGTG CTGGGAAGCA CTCGAAAACG CAGGGTTGAA TCCCGAAGAA TTGCGTGGCT CTGATGGTGC AGTTTATTTG GGTTCAAGTA GCCTTGATTA TGCTCGCGAT ATGATGGGGT TGGGCGAAAA ACAGCTGGTC AGTCAATTAG GCACAGGTAC AGCAAATAGC GCCATATCCG GGAGAATCTC ATATTTCCTG GGTTGGCGTG GTCCAAGCAT GACTGTTGAC ACAGCTTGTT CTTCCTCGTT AGTAGCGATC CATCTCGCTT CTTCAGCACT TCGAAACAAG GAAACTTCAT TCGCGATAGC TGGTGGAATT AATATTGTCC ATAACCCTGT CAGCCATATC ATTTTCACTC GTGCCAATAT GTTAGCGCCC GATGGGCGTT GTAAGACCTT CGATGAATCA GCGGATGGTT ATGGTAGAAG TGAAGGGGCG GCAGTACTCG TACTTCGTCG ACTTTCATCT GCTATAGCAG ACGGAAACCG TATCCTAGGG GTAATTCGTG GATCGGCAGT ACGCCAGGAC GGTGAGAGTG GCGGTCTAAC TGTTCCTAAT GGTGTGGCGC AAGAGGCTGT TATGCGCGAT GCGCTCAGAC GTTCTTTGGT TAAACCAGCC GATATCAGCT ATGTAGAGGC ACACGGAACC GGAACACCTC TCGGAGATCC CATCGAAATT CACTCCATTA ATTCGTTATT TTCGGAAAGA CCGAATATAG ATGAAAAGGT TTATGTGGCC TCATCAAAGA CGAATCTCGG ACATATGGAA GCTGCAGCTG GTGTTGGCGG CATCTTAAAG TGCTTGGCAC AGCTAAATAG CAAATTAATT TACCAGCATA TTAATTTTAA TAACCCATCA TCAAAAATTG ATTGGGGGGC TTTGTCTGTT ACGGTTCCGA CCGAAGTTCA GCCTTGGGAA CAACCGGTGC GTAGGGCAAT GGTCAATTCC TTTGGTTTTG CGGGTACGAT CGCCAGCTTA GTCCTTGAAG AAGCGCCTGT GATGAATGCC CCCCCCCGTG TCGATAAGAA GGGTACGTCA GGAGCCTCAG ATGATCCTTT TATATTTACG GTTTCTGCCA AATCATCATC TGCTCTCTTG CGTAATATGC GGGCGTATTT AGAGACTCTC ATTAAGAGCA GTGCTGATGA TATGGATAGC ATCATTTGGA CATCAAGTGT GGGGCGGCAA CACCACCCAT ACCGTTGGGC TGCCTCAGCA AGAAACAATG ATGAGTTGAT GCAGCTTATG CGTAGTTCAT TAGACGAATC TGAAGAACTA ATGGCCGATA CGGAAAGAAT GCTGCAATTT TCCGAGGCTC GTGTTGCATT TCTGCTCACT GGGCAGGGAG CACAATACCC TGGTATGGGA GCAGGACTCT ATAAGAAGAA TCGAGTTTTT CGGCGTGCAC TTGACGAAGT CTCGCATCAC TTTGATTCAG TGCTGGATGT GAAATTAAAG GAATTAATGT TCGATCAGGG GGCAATAGCC AAAGAGCGGC TTGTGCAGAC GAAGTACACT CAACCTGCCC TCTTTGCATT CAACTATGCC GTGGCAAAGA TGTGGATGGA ATATGGTATC GAACCTGCTG TGATGCTCGG TCATAGTGTG GGTGAAATTG TTTCAGCCTG CTTGTCGGGA TTGTTCACCT TAAAAGATGC CGTACGGATG ACAGCTCGCC GTGCTGAGGT AATGCAATCG GTCAAGTTAG ATGGTGCCAT GTTAGCGGTA AAAGCGACCA AAGAGCAGGT GTCTAACTTC ATAGAGTCAT TTGATGATGT TGGCTTTGCG GGCTTTAACG GTCCGAAACA AACGGTTATT TCCGGCGGTA TTCTTTCGCT AGACATGATT GCCGACCGTT TGTCGGAGGA AGAAATTCCT CATCGTAGGT TGGAGGTTTC ACACGCTTTT CACTCTGCTC ATATGAATGA AGCAGCTGAG ATATTCCGTG ACTACATGAC CACAGTTACT TTCCATCCGC TGCAGAGGGA GTTCATTTCC AATGTGACCG GCAAGGTGGC TACTTATGAT CTGGTCGCAT CGCCCGATTA TTGGGCGCGA CATATATGCC AGCCTGTCAA CTTTGCTGCG GGTATGGAGA CGATAGCGAC CAGAAGTTCC TATCTCTTTC TGGAAACAGG ACCAAGCCCG CATTTGACTG CTATGGGGCG TGGGTGCATT AAGGCATCTG ACCACTATTG GGTTGAAACA GTCAAACCAT CGTTAGTTGA GGAGGATAGC CTTGATGAGG CAATTCTCTC TTTATATAAG GCGGGGCAGA AACTTGACTG GCGTATAGTT CATGCGGGTG TGTCGAACAA TATGTGTGAG TTGCCTCCAT ACTCGTTTGA TCATGAATCC TACTGGTTGC CAGTTGCTAC GAGTGGAGTG GACAGGGGCT CGGATTTCCA CCCACTGCTT GGCCAACTCG CAGCACGTCA ACATTCGCAA TGGACGTTCA CAGCGTTTGT GGCGCCAAGT TCACCAGCTT ATCTTGCTGA TCATGTCGTT ATGGGGCGCA CGATCTTTCC TGGTACTGGC TACGTCGAAG TCGTACTGGC CCTTCAGGAT GCAGTATTTG GCCACACGGG CATGGTAATG GAAGACCTTG AAATTCATGC ACCGCTAATC CTAGCTGACG AAGCGTCAAC TGAGTTATCA ACACAACTGA CATATCAGAA GGATGGTGTA TACAGTGTAG AAATTAGTTC TGTAGCGCAG GGTGAGAATA CAGTTCACTT TACCTGTCGC ATGTTGGAAG ACCCAACACT TGTTTCTATA GTCGATCTCC CGGACGGATC AAGAGATGAC GAGCAATCAA CGTTTGACAA AACCATTCTC TACAACAGAC TAGCTTCTTT AGGCCTTCAA TATGGGGAAC AATTCCAGCG CGTTAGATCA ATTCGTAAGG ACGGCGTTCG TCGTGTTTTT GGTACTCTTT CGACAGAAAA CATCAATAGC TGGGAGTTTG TAAATCCAGG CTTGTTTGAT GGAGTGTTGC ACACCCTTGA GCCTATTCTC GGTGCAGATA GGACTCTTGT TCCGGTTGGT TGGTCCAAGA TACGCGTCTA CCGCAAACCT CGGGGAAATG TCGAGTGCGT TGCAGAATTG CGTGTTGATT CTGATCCATT TAGTAGCGAA GTCCTAGCTG ATTTAACCCT GTACGGTGAT GGATTGCTTG TTCTGCGTGC TGAGGGACTA CGTTTGCGTG AAGTGAAATC CCGTGAACGT TCATCATTGC CATTTTTCCA CCGTATCGTT TGGAAAGAGC AGCCGTTGGA TCTTCAGGCT TTTCGTGCAG TGCGCCTAAT CGGTATTAAC TGCCCTGATG CCTTGAAACC ACATTTGTCA TCACATTTCG AGTTAGTTGC TGATATAGAT AAGGCTTTGG ATGCGATACA TACATCACAG GGAAAGCCGA CAAAGTTTGT CTGTTTCTGG AATGGAAAGG AGTTGCCTGC CGATGCTGAT GCCGATGCAA TTATGGCAGC CAGCCAATCT TTTTACGAAC CGTTGTTAGC ATTTATAAAG GCACTTGCAA AACTTGCACT TAAAGAGACT GTAGAACTTG TCTTTGTGAC AAAAGGGGTG CAAACAACGG GCAGGGAAGA TGCGAGAGCT GGAACTGAGG AACCACTGTC ACTTAATACC CAGTTGCAGG CAACAATATC CGGCTTTTGT TCGGTACTAA ACTCTGAGTT TTCGCGCATA CGTGCAAAGG TTGTAGACCT CCCGTGTATG GGTGAAGACC AGGACGATGT TTATTCACTC CTAAACGAGT TGCATTTGGG CAATAGTCGC TCAGATTTTC AAGTCGCTTA TCGGTGCAAT AGCCGTATGG TGAAACACCT GGAAGTGGCA AAAGTTATTG AGTCTGAGGA AAACTATCAG ATCGTGGTAG CCGATGACGG ACTGCTGTCT GGATTGGGGA GAAAGCCATT GCCACGCATC GTTCCGTCGG GGGATGAAAT TGAAGTTTGC GTCGAGGCGG CGGGTTTGAA CTTTAAGGAT GTGCTCAATG CCCTTGGACT TTTGAAGGAG CATTCAAAAA GCGAGGGATT GACTCATAAA GAGTTGCCGC TTGGATTTGA GTGTGCTGGC GTTGTCACCG CAGCAGGAGA TGACGCTGAG TTTATGGTTG GCGAATCGGT AATGATCAGC CACCTCGGGT GTATGCAGCG CTATGTAACG GTGAGCAGTC GTGCTGCAGT ACGTATTCCA GATGGGATCA CTATGGAGCA AGCTGCTGCT ATCCCAACTG CCTTCATCAC GAGTTATTAC GCACTATATT CGTTGGCTAA AGTGAATGCA TCGGATCGTG TTCTGATTCA TGCGGCAGCA GGTGGGGTTG GTCAAGCCGC GCTTCAGTTA TGCAGGCGTG TGGGTGCTGA GGTATTTGCC ACGGCCAGCC ATCGTAAGTG GGATGTATTA AGAAACCAGG GTGTCGACAG AATCATGGAT TCGCGCAACA TCAATTTTGG AGAAGAAATT CTCCGGATGA CGGATGGTGG TGGTGTTTCA GTCGTTCTCA ATAGCCTGAA TAAAGACTTC ATTCCGGTCA GCCTAGCGGC GACGGCGCAC GGCGGGAGAT TTGTTGAACT CGGAAAACTT GGTGTGTGGA GTAAACAACA GGTAACTGAA GTGCGGCCTG ATATTTCATA CTCTCAGTTT GACCTGAGTG AAATCTCAGA AAACGAGCTC TTAGACTTGA ATAAGAGCAT TCTTGAAGAC ATTGCAGCAT ATCTTTCCGC GGGCGAGATT ACCCCCCCTC TTGTTACCAG TTATTCAGTT ACAGACATAG CTGAAGCCTT TGGTGTGCTA TCACGAGGGG AAAATGTAGG GAAAATCGTT CTCACGTTCG GTCGTGAGCA GGACAGTGAC TTGTCGTTGT CTCGTGTAGT CAGTGGGGAA GGTACTTATG TAATCACTGG AGGATATGGC GCTCTAGGTC AACGTGTGGC TCGCTGCTTA ATTCAGGCCG GAGCACGAAA TGTGACATTG CTTGGGCGAA ATTTGCCGAA TCAGGATGCA CTTGCACAGT TGAAAATGAG GCTAGATGGA GTAGAGAACC TTGATCTGTG TACAGGTGAT GTTGCAGATT CAGATGTTGT TGATCGGATT TTTGTAGAAG CCGCTGAGAA AGGGAGACCT GTTTGCGGAA TTGTTCATCT TGCTGGACTT ATTTCAGATG CGCCAATTAC CGAGCAAACT TGGATTAGCT TCCGAACTGT GTTTCTTCCC AAGGTAGTTG GCACGTGGAA TCTGTGGAAA GCGGCAGAAC GTCATGGTGG CGTAGAACTA TTTGCCGGGT TCTCATCGAT TGCGTCGGTG GTCGGATCTG TTAGTCAGTC AAACTATGCC TCAGCGAACG CATTTATTGA TGGCTTGATG AATCGCACCC GGGGAGGGGG GCGTATTGGA TTAGCACTTA ATTGGGGGCC GTGGGGCGGA GCGGGAATGG CTGCGGAGTT AACCGATCAG CAGAGGAAGT CTATTGAGCG AAAGGGCTTC TCACTGATTC CGATGCAGTT GGGAATGGAG GCATTCGGTC GCTTGGCCAG GCAAGCTCAA GGACAGGTTG TCATCGGGAA CGTCGATTGG AGTGCTTACA AAGAAAGCCT GCCTGGTGAC GATTCCCTGT ATGACGATGT AAAGGATGCG AAGAGTGACG CACCTTCTGA CGTTTTTGAT TATAACAGTT TGCTTGCTTT GAGTGAGGAC CAGCGTAAGG AGATAGTGCT TGAGAAGCTC ATATTGATTC TTCGTCAGGT GTTGCAGTAT GGGGAAAAAG AGAGGGTTTC TCGTCGTGCC ACATTTTCAG ATTTGGGAAT AGATTCTTTG GTAGCTGTAG AGCTGCGAAA TACTCTGGAA AAATCATTTG GCATCGCACT CCCATCATCA CTGGTATTTG ATTACCCTTC TGTACCAGTA TTAAGCGGAT ATTTGCTCGA ATATTTAAAA AATAACTATG TGTCGACTTC AGGAGATGGA GAAAATTCTG ATGCATCATC GGAAATGCAC AAAACTTCGG AGGAAGAGGA TTCCACTGAA GGAGAGCTTA TATGA
|
Protein sequence | MEQPTINSET MVKQLLSEKH EPIAIVGLSL RLSDGISDLN GFHHLLLDGR DNIVDIPESR WDNKKYFSSA NDSGNAICTN RGGYLNDIDK FDPAFFSISP KEARYMDPQQ RIMLELCWEA LENAGLNPEE LRGSDGAVYL GSSSLDYARD MMGLGEKQLV SQLGTGTANS AISGRISYFL GWRGPSMTVD TACSSSLVAI HLASSALRNK ETSFAIAGGI NIVHNPVSHI IFTRANMLAP DGRCKTFDES ADGYGRSEGA AVLVLRRLSS AIADGNRILG VIRGSAVRQD GESGGLTVPN GVAQEAVMRD ALRRSLVKPA DISYVEAHGT GTPLGDPIEI HSINSLFSER PNIDEKVYVA SSKTNLGHME AAAGVGGILK CLAQLNSKLI YQHINFNNPS SKIDWGALSV TVPTEVQPWE QPVRRAMVNS FGFAGTIASL VLEEAPVMNA PPRVDKKGTS GASDDPFIFT VSAKSSSALL RNMRAYLETL IKSSADDMDS IIWTSSVGRQ HHPYRWAASA RNNDELMQLM RSSLDESEEL MADTERMLQF SEARVAFLLT GQGAQYPGMG AGLYKKNRVF RRALDEVSHH FDSVLDVKLK ELMFDQGAIA KERLVQTKYT QPALFAFNYA VAKMWMEYGI EPAVMLGHSV GEIVSACLSG LFTLKDAVRM TARRAEVMQS VKLDGAMLAV KATKEQVSNF IESFDDVGFA GFNGPKQTVI SGGILSLDMI ADRLSEEEIP HRRLEVSHAF HSAHMNEAAE IFRDYMTTVT FHPLQREFIS NVTGKVATYD LVASPDYWAR HICQPVNFAA GMETIATRSS YLFLETGPSP HLTAMGRGCI KASDHYWVET VKPSLVEEDS LDEAILSLYK AGQKLDWRIV HAGVSNNMCE LPPYSFDHES YWLPVATSGV DRGSDFHPLL GQLAARQHSQ WTFTAFVAPS SPAYLADHVV MGRTIFPGTG YVEVVLALQD AVFGHTGMVM EDLEIHAPLI LADEASTELS TQLTYQKDGV YSVEISSVAQ GENTVHFTCR MLEDPTLVSI VDLPDGSRDD EQSTFDKTIL YNRLASLGLQ YGEQFQRVRS IRKDGVRRVF GTLSTENINS WEFVNPGLFD GVLHTLEPIL GADRTLVPVG WSKIRVYRKP RGNVECVAEL RVDSDPFSSE VLADLTLYGD GLLVLRAEGL RLREVKSRER SSLPFFHRIV WKEQPLDLQA FRAVRLIGIN CPDALKPHLS SHFELVADID KALDAIHTSQ GKPTKFVCFW NGKELPADAD ADAIMAASQS FYEPLLAFIK ALAKLALKET VELVFVTKGV QTTGREDARA GTEEPLSLNT QLQATISGFC SVLNSEFSRI RAKVVDLPCM GEDQDDVYSL LNELHLGNSR SDFQVAYRCN SRMVKHLEVA KVIESEENYQ IVVADDGLLS GLGRKPLPRI VPSGDEIEVC VEAAGLNFKD VLNALGLLKE HSKSEGLTHK ELPLGFECAG VVTAAGDDAE FMVGESVMIS HLGCMQRYVT VSSRAAVRIP DGITMEQAAA IPTAFITSYY ALYSLAKVNA SDRVLIHAAA GGVGQAALQL CRRVGAEVFA TASHRKWDVL RNQGVDRIMD SRNINFGEEI LRMTDGGGVS VVLNSLNKDF IPVSLAATAH GGRFVELGKL GVWSKQQVTE VRPDISYSQF DLSEISENEL LDLNKSILED IAAYLSAGEI TPPLVTSYSV TDIAEAFGVL SRGENVGKIV LTFGREQDSD LSLSRVVSGE GTYVITGGYG ALGQRVARCL IQAGARNVTL LGRNLPNQDA LAQLKMRLDG VENLDLCTGD VADSDVVDRI FVEAAEKGRP VCGIVHLAGL ISDAPITEQT WISFRTVFLP KVVGTWNLWK AAERHGGVEL FAGFSSIASV VGSVSQSNYA SANAFIDGLM NRTRGGGRIG LALNWGPWGG AGMAAELTDQ QRKSIERKGF SLIPMQLGME AFGRLARQAQ GQVVIGNVDW SAYKESLPGD DSLYDDVKDA KSDAPSDVFD YNSLLALSED QRKEIVLEKL ILILRQVLQY GEKERVSRRA TFSDLGIDSL VAVELRNTLE KSFGIALPSS LVFDYPSVPV LSGYLLEYLK NNYVSTSGDG ENSDASSEMH KTSEEEDSTE GELI
|
| |