Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_3023 |
Symbol | |
ID | 4257573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | - |
Start bp | 3583980 |
End bp | 3585503 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638123701 |
Product | tryptophan halogenase |
Protein accession | YP_739064 |
Protein GI | 114048514 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.172651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATCA GAAAAATAGC CATTATAGGG GGCGGGACTT CAGGTTGGTT AGCGGCGAAT CATTTAGGGC GAGTGTTGCA GGGGCGCTCT GAGCTATCTA TCACGCTGAT TGAATCTCCT GATATTCCTA TCATCGGAGT TGGTGAAGGT ACTGTACCGT CCATTCGGAA ATCACTACAG AGTTTTGGGA TCAGTGAATC AGAGTTTATT CGCTCATGCG ATGTGACATT TAAGCAGTCA ATTAAATTTG TCAATTGGTT AGATAAAGCC CGACATGGTA AGGATAATTT CTACCACCAT CTATTTGATA TCATCAATTT ACAAGGTGTA GACACTGTTT CTGCTTGGTT GGGCGAGAAA CAAGGCGATT TTGCTGATTA TGTTTCTCCC CAGCATCTTG TTTGTGAGGC CGCAAAAGCG CCAAAGCTAA TCACGACACC TGAATATGCT GGAGTGTTAG GCTATGCCTA TCATTTAAAT GCGGCAAAAT TTGCCAAATT ATTGGCTAAA AATGCGATTG AGAAATTCAA CGTTGAGCAC ATTTTTGCCA CTGTGCAGGA TGTTCGTTTA GGCGATGATG GTGCAATTTC ATCTTTGCTG ACTGAGCAAG GGACTCTCTC TTTCGATTTC TATATTGATT GCAGTGGCTT TGAATCGATT TTGTTAGCTA AGGCACTAAA AGTGCCATTT ATCAGTAAAG CGCATCAACT GTTTATTGAT ACTGCGTTAG TCGCGCAAAT TCCAACGCAA CCTACGGATA TCATCCCTCC TTATACTCAA GCAACGGCCC ATCGAGCAGG TTGGATATGG GATATCGCAT TGACCCAGCG CCGAGGCACG GGATTTGTCT ATTCGTCTAC TCATATGGCG CAGTCAGAGG CAGAACGAAA GTTTGACCAT TATCTCGGTG GCAAACTTGC CGATGTTCCG CATCGTAAAA TTCCAATGAC AGTGGGCCAT CGTCAACAGT TTTGGGTAAA AAACTGTGTA GCTTTAGGGT TAGCTCAAGG CTTTTTAGAG CCGATTGAAG CGACATCCAT ATTATTAACG GATTTCTCTG CACGTTTTCT GGCAGAACGT TTTCCAGTGC ATACAGATGA TGTTGATTAC TTAGCAAAGC GGTTTAATGA TACCGTGGGC TACGCATGGG AGCGGGTCGT TGAATTTGCC AAACTACATT ACTGTTTATC GGATAGAACT GACTCATCAT TTTGGCTCGA TAATCAGGCT TCAGAGACTA TTCCCGAAGG ATTAAAGCAA CGACTCGAGC TATGGAAATC CTATGGTCCT ATTGCTGAAG ATTTTCCATC TAAGTTTGAA GTATTTAACT TAGACAATTA TCTATACGTT CTTTATGGCA TGAAATATCA GACTAATTTG CGTGGCGGCT CATCGACAAC ACAGGCCTCT TTGAACATGT ATATGAACAA AATGTCGAAG GTAAAACAAC AGATGCGAGA TGGGTTGCCT GAGCATAGGG AATTGCTCGA TAAAATCTGT ACCTATGGTT TGCAATCTAT TTAA
|
Protein sequence | MTIRKIAIIG GGTSGWLAAN HLGRVLQGRS ELSITLIESP DIPIIGVGEG TVPSIRKSLQ SFGISESEFI RSCDVTFKQS IKFVNWLDKA RHGKDNFYHH LFDIINLQGV DTVSAWLGEK QGDFADYVSP QHLVCEAAKA PKLITTPEYA GVLGYAYHLN AAKFAKLLAK NAIEKFNVEH IFATVQDVRL GDDGAISSLL TEQGTLSFDF YIDCSGFESI LLAKALKVPF ISKAHQLFID TALVAQIPTQ PTDIIPPYTQ ATAHRAGWIW DIALTQRRGT GFVYSSTHMA QSEAERKFDH YLGGKLADVP HRKIPMTVGH RQQFWVKNCV ALGLAQGFLE PIEATSILLT DFSARFLAER FPVHTDDVDY LAKRFNDTVG YAWERVVEFA KLHYCLSDRT DSSFWLDNQA SETIPEGLKQ RLELWKSYGP IAEDFPSKFE VFNLDNYLYV LYGMKYQTNL RGGSSTTQAS LNMYMNKMSK VKQQMRDGLP EHRELLDKIC TYGLQSI
|
| |