Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_1098 |
Symbol | |
ID | 5661497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | + |
Start bp | 1328192 |
End bp | 1329718 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641235644 |
Product | tryptophan halogenase |
Protein accession | YP_001500960 |
Protein GI | 157960926 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTA AAAGAATAGC GATTGTTGGT GGTGGTACAG CGGGCTGGCT GGCGGCAAAC CATCTTGGTA AAGCGCTGCA AGAAAATGAT CAGTTATCAA TTACGTTAAT CGAATCTCCG GATATCCCAA CCATTGGGGT CGGGGAGGGC ACGGTACCCG CAATACGTCA GTCGCTGCAA AGCTTTGGCA TTAGCGAGAC CGAGTTTATT CGCCGCTGTG ATGTCACCTT CAAACAATCG ATAAAGTTCG TTAACTGGCT CGACAAGAGC CAGCATGGGC AAGATAACTT TTATCACCAT CTGTTCGATA TGCCTGGCTC AGTGCTGCAA GATTTAACTG CTCGCTGGTT AGCGAACAAA AATAGTTGTT ACGCTGGTTC GGTATCGCCG CAGCATGAAG TCTGTGAAGC TTTTAGGGCA CCAAAGAATA TAAGCGATCC TGAATATGTG GGAAAGCTCG GTTATGCCTA TCACCTCAAC GCGGCTAAGT TTGCTAAATT GCTCGGTGAA AATGCTAAAC AGCGATTTCG AGTTGAGCAC CTTTGGGCTA ATGTGCAGGA AGTTATTCTT GGGAAAGACG GTGAGATAGT ATCTCTAGTC ACAGATAGTG CAGGCAAGCT GGATTTCGAT TTCTATATTG ATTGTAGCGG CTTTGCTTCA ATACTTATAG ATAAGGCGCT TAAGGTTCCT TTTTTAAATA AGGCTGAGCA GTTGTTTGTC GACAAGGCAG TAGTCGTTCA AGTTCCCACT GCCAATGACG ATGTTATCCC TCCCTTTACC ATTTCTACCG CCCATCAGGC TGGTTGGATT TGGGATATAG CCTTAAGTAA TCGCCGCGGT ATTGGCTTGG TCTACTCTTC TAAATATATG GATGATGAGA CCGCAATCAG CAAGCTAGAT ACCTATCTAG GCGGTACGCT TGCTGAGCAT CAGCATAGAA TCATTCCAAT GACAGTCGGT TACCGTGAGC GGTCATGGGA GAAGAACTGT GTGGCATTGG GCTTGGCGCA AGGCTTTTTG GAGCCATTAG AGGCGACATC GATTCTTTTG ACCGACTTTG CTGCCGGATT TTTAGCTCAA CGCTTTCCAG ATACTACTTC TCAGTTAGCC GCAATACAGC AGCGTTTTAA TCACGTTATG GGTTATGCCT GGGAGCGAGT AGTAGACTTT ATTAAGCTAC ATTATTGCCT GTCGGATCGT GAAGATTCAC AATTTTGGAT AGATAATCGT GATCCGGAAA CTATGTCTGT AGAGCTCAAA AATCGGTTAG CACAGTGGCA AGATTTTGTG CCCTGTAGAG AAGACTTCTT TAGCAGGTTC GAGGTATTTG ATTTAGAAAA TTACCTCTAT GTTCTGTACG GTATGCAGCA TACCCCTGAG CGGAGGCAAA GGAACCTTGA GCTAGGGGCT GAATCCTGTG CTTTACAGCA TAGACTGCAT ACTGTGGCTA AACAGCTTAC AGCCGAGTTA CCGCAGCATA GAGACTTGTT AGAGAAAATA AAGCGTTACG GTTTGCAGAA GGTATAG
|
Protein sequence | MDIKRIAIVG GGTAGWLAAN HLGKALQEND QLSITLIESP DIPTIGVGEG TVPAIRQSLQ SFGISETEFI RRCDVTFKQS IKFVNWLDKS QHGQDNFYHH LFDMPGSVLQ DLTARWLANK NSCYAGSVSP QHEVCEAFRA PKNISDPEYV GKLGYAYHLN AAKFAKLLGE NAKQRFRVEH LWANVQEVIL GKDGEIVSLV TDSAGKLDFD FYIDCSGFAS ILIDKALKVP FLNKAEQLFV DKAVVVQVPT ANDDVIPPFT ISTAHQAGWI WDIALSNRRG IGLVYSSKYM DDETAISKLD TYLGGTLAEH QHRIIPMTVG YRERSWEKNC VALGLAQGFL EPLEATSILL TDFAAGFLAQ RFPDTTSQLA AIQQRFNHVM GYAWERVVDF IKLHYCLSDR EDSQFWIDNR DPETMSVELK NRLAQWQDFV PCREDFFSRF EVFDLENYLY VLYGMQHTPE RRQRNLELGA ESCALQHRLH TVAKQLTAEL PQHRDLLEKI KRYGLQKV
|
| |