Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3110 |
Symbol | |
ID | 4244201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4761458 |
End bp | 4762408 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638108123 |
Product | taurine dioxygenase |
Protein accession | YP_722716 |
Protein GI | 113476655 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAA TACCTCAAAC TCAGGAAAAT AAACATTTAG AGTCAAATGT ATATAATAAC TTTAATGTTT ATCCCCTCGC TGGACGTATC GGAGCAGAAA TAGTTGGACT TGATCTTAAG CAGACTCTTA GCGATGAAAC AATTCATGAT ATTCGCCAAG TATTAATCAA GTATAAGGTA ATTTTCTTTA GACAGCAAGA GCTTACTGAA ATAAGTCAGG TTGCCTTTGC CCGTCAATTT GGGATTCTTA CCACTGCACA TCCACTACTT TCATCTTTAC CTGGTCATCC AGAGATCTTT GACTTCGATT ATGGACGTAT GGACAACCGA ACTAATCAAT GGCATACGGA TGTGACATTT ATTGATCGTC CTCCTTTTGC CTCAATTTTG CGTGCAGTTG AAATACCTGC CGTTGGAGGA GATACAATCT GGGCAAATAC TGTGACAGCT TATCAAGATA TGCCTATACC ACTGCGTAAC TTTGCTAACC AGCTTTGGGC GGTTCATAGC AATACATATA ATGACTATCT AGGGGCAACT GCAAATATAT CGAAAAAACG ACAAGAACTA GGTAAAATTT TCACTTCAAT TGAATACCAG ACATTACATC CAGTAGTTCA GGTTGTTCCT GATTCGGGTG AAAGAGGGCT GTTTATTGGT GCTTTTGTCC GCCAACTTCA AGGTTTTTCG ATAAATGAAT CAATGCAGAT ACTGAAGATA TTGCAATCCT ATATAATACG TCCGGAAAAT ACTGTACGTT GGCATTGGGA ACAAGGCGAT ATTGCTTTCT GGGATAATCG AGTAACGCAA CATTATGGGA TCAATGATTT TGGTTCCCAG CCTCGTCGTG TTCAACGGGT AACAATTGCG GGAAACTTAC CTATTAGCCT TGAAGGTATA GAAAGCAAGT CAGTTAAGGG GGATGCCTCT GCCTATAACA GACTGAAATA G
|
Protein sequence | MSKIPQTQEN KHLESNVYNN FNVYPLAGRI GAEIVGLDLK QTLSDETIHD IRQVLIKYKV IFFRQQELTE ISQVAFARQF GILTTAHPLL SSLPGHPEIF DFDYGRMDNR TNQWHTDVTF IDRPPFASIL RAVEIPAVGG DTIWANTVTA YQDMPIPLRN FANQLWAVHS NTYNDYLGAT ANISKKRQEL GKIFTSIEYQ TLHPVVQVVP DSGERGLFIG AFVRQLQGFS INESMQILKI LQSYIIRPEN TVRWHWEQGD IAFWDNRVTQ HYGINDFGSQ PRRVQRVTIA GNLPISLEGI ESKSVKGDAS AYNRLK
|
| |