Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2941 |
Symbol | |
ID | 4253512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 3513142 |
End bp | 3514677 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638119577 |
Product | tryptophan halogenase |
Protein accession | YP_735069 |
Protein GI | 113971276 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.368226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.400178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA AACGCATTGC AATTATTGGT GGCGGAACAG CGGGTTGGTT AGCCGCAAAC CATTTGGGCG CAGAGTTATG TCACGACCCG GAAATCGAGA TCACCCTGAT AGAATCTTTA GACGTACCAA CCATTGGCGT GGGGGAAGGA AGTGTGCCTT ATCTTGCCAA AGGGCTGAAA CGTTTTGGTA TTTCAGAGGC TGAGATGCTA CTGACTTGCG ATGCAACTTT TAAGCAAGGG ATTAAATTTG TTAATTGGTT AGACCCAGAA CGACATGGAG AAAATCATTA CTATCATAGT TTTGATACCC CCTATCCCGC GGGTGTCGAT GTATCGAATT ATTGGCTGGC AAATGCTGCT GACCGGCCCT TTGATGATGT CGGGATCCAA GCTCGGATTT GTGAGTTAGG ATTATCGCCT AAACGTAAAG GTTCTGGCGA TTATGAAGGC GCACTGTCCT ATGCCTATCA CTTCAACGCC TTGAAATTTG CAGTTCTTCT CGCTAACAAC GCCAAGTCGA GATTTAAGGT AAAGCATCAA TTTGCCACGA TTAAAGGTGC CGAAGTTTGT CAATCCGGCC GAATTACCCA TTTAGTGACT GCGGATGATA CTTCACTCGC ATTTGATTTT TATGTGGATT GCAGTGGATT TGCTTCAATC TTAATTGATA AAACCCTCAA AGTGCCTTTT GTGAGTAAAG CTGATGAGTT ACTAACTGAT ACAGTACTCG TTCAGCAGGT TGCACTGGGT GCCGATGAGG AGATTAACCC CTACACCACA GCGACAGCCC ATAAAGCAGG CTGGATTTGG GATATTCCAC TCACGACTCG GCGTGGTACT GGTTTTGTGT ATTCTAGTCG CCATATGAGT GATGACGAAG CGTTAGGCTT ATACGCCGAC TATCTAAGCA TCGATAAAGC TCAATTTAAT CCGCGTAAAA TTCCAATGAA GGTCGGCTAC AGAGAAACAT TTTGGCATCA GAATTGTGTT GCATTAGGGT TAGCGCAAGG CTTTGTGGAA CCGCTAGAAG CAACGTCGAT ATTGGTATCG GACTTTTCTG CTGAATTACT TGCACTTAAT TTCCCTCGAC ATCTGGAAGA TATCGATGTG CTCACGCCTC ACTATAATCA GGTCACGACC TATGTGTGGG AAAGAGTCGT GGACTTTATT AAATTGCATT ACTGCATTTC AGATAGAACA GACTCGGATT TCTGGCTCGA TAATAAAAAG AGCGAGACTA TATCTGAGGA GCTACATCAG CGACTTGCGC GTTTTGCCTT AAGACCTCCT TATGCATCTG ATTTTTTTGG GCGTTTTGAG TTGTTCGACC ATAAAAACTT CCTTTATGTG TTGTATGGAA TGAAGTTCAA CACGGCTGCT ACAGTCTTAT CAGCACAAGA AGTTCAGCAT TGTGATGCAT TAATGAAAGG AAATGATCAG CTAGTCGTTA AAGCGGCTTC AATGCTATTA AAACATCGAG ATTGGCTTGA AGGCTTAAAG CAGGCCTTTG AGCAGGCTCG TTCTACAAAG GTTTAA
|
Protein sequence | MKIKRIAIIG GGTAGWLAAN HLGAELCHDP EIEITLIESL DVPTIGVGEG SVPYLAKGLK RFGISEAEML LTCDATFKQG IKFVNWLDPE RHGENHYYHS FDTPYPAGVD VSNYWLANAA DRPFDDVGIQ ARICELGLSP KRKGSGDYEG ALSYAYHFNA LKFAVLLANN AKSRFKVKHQ FATIKGAEVC QSGRITHLVT ADDTSLAFDF YVDCSGFASI LIDKTLKVPF VSKADELLTD TVLVQQVALG ADEEINPYTT ATAHKAGWIW DIPLTTRRGT GFVYSSRHMS DDEALGLYAD YLSIDKAQFN PRKIPMKVGY RETFWHQNCV ALGLAQGFVE PLEATSILVS DFSAELLALN FPRHLEDIDV LTPHYNQVTT YVWERVVDFI KLHYCISDRT DSDFWLDNKK SETISEELHQ RLARFALRPP YASDFFGRFE LFDHKNFLYV LYGMKFNTAA TVLSAQEVQH CDALMKGNDQ LVVKAASMLL KHRDWLEGLK QAFEQARSTK V
|
| |