Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_0932 |
Symbol | yhaN |
ID | 2857962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | + |
Start bp | 1027216 |
End bp | 1030140 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637512368 |
Product | hypothetical protein |
Protein accession | YP_035271 |
Protein GI | 49479811 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG AAAAACTCCA TATTTATGGA TACGGAAAAT TAGAAAATGT GGAAATGGAA CTGTCAATGC TGACGGTGTT ATACGGTGAA AATGAAGCAG GGAAATCAAC AATTCGTTCG TTTATGAAAA GTATTTTATT CGGTTTTCCG ACGAGAGGAC AGCGCCGTTA TGAGCCGAAA GAAGGCGGGA AGTATGGCGG AGCAATCACT GTGCAAACAG AGAAGTACGG CCGTTTGAAA ATTGAACGAT TGCCAAAGAC GGCTGCTGGT GAGGTAACTG TTTATTTTGA AGACGGGAAA ACGGGCGGCG AAGAAATTTT AAACGATATA TTAAGCGGGA TGAATGAAAG TTTATTTGAA TCGGTCTTTT CTTTTGATAT GCACGGTCTT CAAAACATTC ATCAGCTTGG CGAAGCGGAT ATCGGCAATT ATTTATTTTC GGCAAGTGCA GTCGGAAGCG ATGCGCTATT ACAGCTCGAT AAAAAGCTAG AGAAAGAAAT GGATCAGCGC TTTAAGCCGA GTGGGCGTAA ACCAGAAATT AACGTGTCAC TGCAAGAGAT GAAAAAGCTG GAAGAGAAGA TGAAAGAGTG GCAAGGGAAA ATTGGCACGT ATGAAAAGCA AGTGGAGCAG TTAAAAGAAA GTGAAGAGAA GCTTGCTTCT GTTCGCGCAG AAAAAGAGAG TGCAGAAAAA CGAAAGCAAG ATTATGAAAT ATTAGCAGCG CTTGAACCGC TCGTTATTGA AAAACGTGCG TATGAGAAAG CGTTAGAAAG CGAGAGCGGG CAATTTCCGG TAAACGGAAT GGCGCGTTAT GAAGCGATTA AGGCGAAAAT GGAGCCGCTT CAATTACAAG TTGATTCGCT TCATAAAAAA ATAGAGACAG TGCAATCGGA AATGGAATCG ATTCAAATAG ATGAAGAATT TTTACAAAAA GAAAGTTATG TAGAAGAACT TCGTATGCAG CATATGTCTT ACGAAAATGC ACGCCAAGAA ATGCGTGATG TGACAGGGAC GATTACGAAT ATAAAAGAAG AACTGGCAGA ACTAGAGCAG CAGATCGGTG CTACATTTGA AGAAGAAACA GTTCTTTCGT TTGATATGAG TTTGGCGACG AAAGAGTTAA TTACGCAAAC GGTGCAAAAG GCGCGCGAAC TAGAAACGCA AAAAGCACAG CTTGATGATC GTTTTAAAGT AGCGCAAGAG CAATTAGAAG AACAAGAAGA AAATATAAGA CAAATTCAGA AGCAAATGTT AGCGGATGAA GAGCGAAGTG CGTTAGTTGA GAAAGAGAAG TCGTTCCAAG ATGCGGCGTT TATCGGTATG GGCGCTGAGA GAATGAAGCG CAAGTATGAG GAAAAAGCAG GAGCGGCTAT GCAAAAGAAA AAGCAGTGGC AAAGAGTTTG TCTTCTGTTA CTTCTTATTA ACACAGGCGT TTTATTCACA AGCTTATTTC TAGATAACCG AGCGCTCTTA TTTATTAGTG TCATTGTTTT TGTAGGGATT GTTCTTGCCC TTGTTTTATA TAAAGATCCG TCAAGCGGAT TACAAGAAGA GCTTCTTACT CTTCAGCAAA GTGCTGGCGG GAGACAAAGT GAAGAAGCGA TGTCAGTGCG CTATCAGTTA GAAAAAGACG AAGAGGTTCG TAAGTTATTT GAGCGGGAGT CATATAAATT GCAGCAAATG GAGCGCGCAT ATGATAAAGT CGTTTCATCG TATGAAGAAT GGGAGAGAGA AACGTTCCGC ATAAGCGAAC AAGTAAATGT GTATAAAAAG CGCTATACGT TCCCTGAATT TTATACGTAT GCGCACATAT TGCCAGCGTT TGAGCGTATG GAAAAAATGC AGCAATTATA TCGTGAATTA GAGAAACAAG GCACGCGAAA ATCTTCATTA TATGAAATGA TTTCGCAATT TGAACATAAA CTAGAAACTG TTATCGGTAG TGCGGAGTAT AGTAAGCTAC ACGAGGCACA AAGCCGTATG CAAAATGAGA AAGAGAAGCG CCAAACTTGT AAGCAGTTAA AAGAAAAACT GGCGGAATGG CAAGAAGAAT ATGAGTTCAT GCAAGAACAA TTAAAGCAAC TACTAGTAGA ACGAGACAGT TTATGGCATA TCGCAGAGTC TACAAATGAA GAGATGTTTT TAGAGGCAGG TACACTAGCG GAAAAACGTG AAGATGCGGA GAAGCAAGTT GGGCGTTTAT TGCCGCAAAT TGATCTGTTA GAACAACGTT TAACGAGTTT ATCATTAACT GAACATTATG AAGCTGACGG TTATGAGGAA AAATTAAAGC AAGAACTGAC AGCCGCGCAA AACTGTCTGG CACAAGAAAA AGAACTAACA GAGCGTATTG CGAAACATCG TATGGAAATT GCGAATTTAG AAGAAGGTAG TACGTACGGT AATTTACTGC ACGAATGGGA AATGAAAAAA GCGCAAGTGC GTGAACAAGT AAAGAAGTGG GCTGCGTATG CGGCTGCAAA GACAGTGTTA ACGAAAACGA AGCAATATTA TCATGAAGTA CATCTTCCTC GTATTTTACA AAAGTCAGAA GAGTATTTCG TCTATTTAAC AGGCGGACGA TATAGTAAAA TCTTTTCACC GTCAGAGGCG GAGCCGTTTA TTGTAGAGCG TAACGATGGT ATGCGTTTTT ATAGTCATGA ACTAAGCCAA GCGACAGCTG AGCAGTTGTA TTTATCGCTG AGATTTGCGT TAGCAAAAAC ATTTGAGCAT GATTATCCAT TTATTATTGA TGACAGTTTC GTGCACTTTG ATGCGGTAAG GACGAATCGG ACGATTGAAC TTATAAAGGA AATTGCAAAG GATAGACAAG TGATCTTCTT TACATGTCAT GCGCATTTAC TCGCGTATTT TACAGAAAAA CAGATTATAA AATTAACGCA TAAGCGTAAA GAAAATGAGT TGTAG
|
Protein sequence | MRMEKLHIYG YGKLENVEME LSMLTVLYGE NEAGKSTIRS FMKSILFGFP TRGQRRYEPK EGGKYGGAIT VQTEKYGRLK IERLPKTAAG EVTVYFEDGK TGGEEILNDI LSGMNESLFE SVFSFDMHGL QNIHQLGEAD IGNYLFSASA VGSDALLQLD KKLEKEMDQR FKPSGRKPEI NVSLQEMKKL EEKMKEWQGK IGTYEKQVEQ LKESEEKLAS VRAEKESAEK RKQDYEILAA LEPLVIEKRA YEKALESESG QFPVNGMARY EAIKAKMEPL QLQVDSLHKK IETVQSEMES IQIDEEFLQK ESYVEELRMQ HMSYENARQE MRDVTGTITN IKEELAELEQ QIGATFEEET VLSFDMSLAT KELITQTVQK ARELETQKAQ LDDRFKVAQE QLEEQEENIR QIQKQMLADE ERSALVEKEK SFQDAAFIGM GAERMKRKYE EKAGAAMQKK KQWQRVCLLL LLINTGVLFT SLFLDNRALL FISVIVFVGI VLALVLYKDP SSGLQEELLT LQQSAGGRQS EEAMSVRYQL EKDEEVRKLF ERESYKLQQM ERAYDKVVSS YEEWERETFR ISEQVNVYKK RYTFPEFYTY AHILPAFERM EKMQQLYREL EKQGTRKSSL YEMISQFEHK LETVIGSAEY SKLHEAQSRM QNEKEKRQTC KQLKEKLAEW QEEYEFMQEQ LKQLLVERDS LWHIAESTNE EMFLEAGTLA EKREDAEKQV GRLLPQIDLL EQRLTSLSLT EHYEADGYEE KLKQELTAAQ NCLAQEKELT ERIAKHRMEI ANLEEGSTYG NLLHEWEMKK AQVREQVKKW AAYAAAKTVL TKTKQYYHEV HLPRILQKSE EYFVYLTGGR YSKIFSPSEA EPFIVERNDG MRFYSHELSQ ATAEQLYLSL RFALAKTFEH DYPFIIDDSF VHFDAVRTNR TIELIKEIAK DRQVIFFTCH AHLLAYFTEK QIIKLTHKRK ENEL
|
| |