Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_2204 |
Symbol | |
ID | 5662597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | + |
Start bp | 2673302 |
End bp | 2674417 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641236807 |
Product | trimethyllysine dioxygenase |
Protein accession | YP_001502059 |
Protein GI | 157962025 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | [TIGR02410] trimethyllysine dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.20714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTATTC ACAAATGTGA ACGTGTAGAT GAAAATCTTG CGATTCAATT TTCTAATAAT AATGAAGCAA ACTTCAGTTT ATTTTGGTTA AGAGATCACA GTACCGATCA GTCCAGCCTT AACCCAGATA CATTACAACG AGATGTTGAA ACCTTTTCCT TACCGACAGT GCCACAGGTA GAACAGTTCC AGATTATCAA TAACGGCGCT CAACTGCGCA TTCATTGGCT AGATGGTGAT CTAGTCAGCG AGTTTGATGC TAGCTTTCTT TTTAATATGG CCTGTACGCC AGTTAATGAC CCCTCATATC AGTTATGGGC TAACGAGCTA CAAAACCAAG TCCCAGATTT TGATTTTGAA CAGGTAACCG CTAACGATGC TGCTTTTTTG CCAGTATTAG AAAGTATGGA TCGGTATGGT TTAGTGACAT TTTCAGGCAT GCCTTCAAAT ATGGAAGCGA CCAAAAAGTT ACTCAATCAA GTCGGGTATA TCCGAGATAC CGTATTTGGT AGTTTATGGG ACTTTTCTAA TAATGGCGCT CATAGTGACA GTGCCTATAC CAGTGTGGGG ATTGGGCTAC ATACCGACAG TACATATACA CTTGACCCGC CAGGCTTACA GTTATTGCAT TGTTTGGCCT TCGACGGCGA AGGGGCCTTT AACCAATTTG CCGATGGCTT TAAAGTTGCT CAAACCATTA AAAATGAAGA CCCTGCCGCT TATGAGACTC TGAGCAAGAT TAAGGTGCCT GCCCATTATA TTGAGCCCGG CATCCAATTA AGAGGCCAGC ATGAGGTTGT GCGCGAAGAT ATTAATGGCC AGTTTGAACA AATTTGTTTT AATAACTTTG ATCGCAGTCC CTTTATGCTA AGTGCTAGTG AGCAAAAGGC TTTTTATCAT GCTTATGGCC TGTTTCAACG TTTAATTAAC GACCCTAAAT ATCAGGTGAA CTTTCAATTA CAACCAGGAC GTGCGGTATG GTTTGATAAT TGGCGTGTTC TTCATGCGCG TAGCGCTTTT AGCGGCTTCC GACATCTTGC CGGCGGCTAC ACTAATCGAG AAGATTACAT AAGTAAAAAG CTAACCTTAA AAGGGAAAAC ACCATGGCAG GAGTAA
|
Protein sequence | MFIHKCERVD ENLAIQFSNN NEANFSLFWL RDHSTDQSSL NPDTLQRDVE TFSLPTVPQV EQFQIINNGA QLRIHWLDGD LVSEFDASFL FNMACTPVND PSYQLWANEL QNQVPDFDFE QVTANDAAFL PVLESMDRYG LVTFSGMPSN MEATKKLLNQ VGYIRDTVFG SLWDFSNNGA HSDSAYTSVG IGLHTDSTYT LDPPGLQLLH CLAFDGEGAF NQFADGFKVA QTIKNEDPAA YETLSKIKVP AHYIEPGIQL RGQHEVVRED INGQFEQICF NNFDRSPFML SASEQKAFYH AYGLFQRLIN DPKYQVNFQL QPGRAVWFDN WRVLHARSAF SGFRHLAGGY TNREDYISKK LTLKGKTPWQ E
|
| |