Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_3898 |
Symbol | |
ID | 5664282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | + |
Start bp | 4745376 |
End bp | 4746665 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641238562 |
Product | tryptophan halogenase |
Protein accession | YP_001503743 |
Protein GI | 157963709 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACTG ATAAAGCTGT TCAGAATGAG TTTGAAACGG ATGCCTGTGA AAGCATCGAT GTCGCCATTA TTGGAGCAGG CCCTTCGGGC AGCATCGCCG CCAGCTTGTT ACAGGCTCAA GGAAAGAAGG TAGTCGTGCT TGAGAAACAA TACTTCCCGC GCTTCTCTAT TGGCGAAAGC TTATTGCCCT GCTGCATGAC GGTGATCGAA GAGGCCAATA TGCTGGAAGC GGTAAATCAA GCTGGGTTTC AATTTAAAAA TGGCGCCGCC TTTAAATACC AAGATAGCTA TACCAGCTTC GACTTTACCG ATAAGTTTAC AGCGGGCCCC GGCACCACTT TTCAAGTTGA GCGCGCGCAA TTCGATAAAC TTCTAGCCGA TGAAGCGGTA AAACAGGGGG TCGATATTCG CTATGGTCAT AGCGTAGAGT CGATCGATTT AGCGGCTGAG CCAATGCTTA ACATCATTGA TGATAAAGGC GGCTCACAAA GCTTAAAAGC CAAGTTCGTG CTCGATGCTA GCGGCTTTGG CCGCGTGCTA CCTAGGTTAC TCGATCTTGA GAAGCCTTCT AGCCTGCCGA CCCGCAGCGC CGTGTTTAAC CACGTGCGAG ACAACATTAG CGATCCGAAC TTCGATCGTA ATAAGATCTT GATAAGCGTG CACCCAGATA ACCAAGATAT CTGGTACTGG TTGATCCCCT TCAGCGATGG TCGCTGCTCG GTAGGAGTTG TCGGAGAGCC GCATCAACTC GAGGGTTTAC ATGACGGCGT TGACTGCGAT ATTGACGGCG ATCTAAACAG CATACTCAAG ACCATGCTCA ACCAAGAGCC AGGGCTGAAA AAACTGCTGG CAAACGCCGA GTTAATTAAT GAGTCAGGCC TGCTAAAAGG CTACTCAGCC AACGTCACAA CCTTAGCCAC GGATAAGTTT GCCCTATTGG GAAATGCGGG GGAGTTTCTC GACCCGGTGT TTTCATCTGG AGTCACCATC GCCATGCAGT CGGCGTCGAT GGCCGCTAAG ACTCTGATAA AACAACTCGA TGGCGAGAGC GTCGATTGGC AGCAAGATTA CGCCGCCCCT TTGATGAGAG GGGTCGATAC CTTTAGAACC TATGTCGAAG CTTGGTACGA CTGTCGCTTT CAGGATGCGA TCTTCTTCAA GGATCCCGAT CCTAAGATCA AACAGATGAT CTGCTCGATT CTGGCAGGTT ATGCCTGGGA TGAGAAAAAC CCGTTTGTTA GCGAGTCTAA ACGTCGCCTT AATATGGTAG TCGAACTATG TCGCAGTTAA
|
Protein sequence | MPTDKAVQNE FETDACESID VAIIGAGPSG SIAASLLQAQ GKKVVVLEKQ YFPRFSIGES LLPCCMTVIE EANMLEAVNQ AGFQFKNGAA FKYQDSYTSF DFTDKFTAGP GTTFQVERAQ FDKLLADEAV KQGVDIRYGH SVESIDLAAE PMLNIIDDKG GSQSLKAKFV LDASGFGRVL PRLLDLEKPS SLPTRSAVFN HVRDNISDPN FDRNKILISV HPDNQDIWYW LIPFSDGRCS VGVVGEPHQL EGLHDGVDCD IDGDLNSILK TMLNQEPGLK KLLANAELIN ESGLLKGYSA NVTTLATDKF ALLGNAGEFL DPVFSSGVTI AMQSASMAAK TLIKQLDGES VDWQQDYAAP LMRGVDTFRT YVEAWYDCRF QDAIFFKDPD PKIKQMICSI LAGYAWDEKN PFVSESKRRL NMVVELCRS
|
| |