Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4462 |
Symbol | |
ID | 4246115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6880579 |
End bp | 6882291 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638109345 |
Product | bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II/unknown domain fusion protein |
Protein accession | YP_723922 |
Protein GI | 113477861 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAAAC AACAAAACCA AAAATTTAAA TTTGACCCCA TTGATACTGC CCTAGCAGAC ATTAAAGCAG GTAAGTGCGT CGTAGTAGTT GATGACGAAC ATCGAGAAAA CGAAGGAGAT GTTATTTGTG CAGCTCAATT TGCTACTCCA AACATGATTA ACTTTATGGC AGTAAAAGCC AGAGGTCTAA TTTGTCTCGC ATTGACAGGC GATCGCCTCG ACCAACTAGA AGTTCCACTG ATGGTTACCA ACAATACTGA CAGTAACCAA ACCGCTTTCA CCGTTAGTAT TGATGCCTCC CCAGAATTGG GAGTCTCCAC AGGTATCTCA GCAGAAGACC GAGCCCGAAC TATCCAAGCT GTTATTAATT CTAACACCAA ACCAGAAGAT TTACGTCGCC CCGGTCATGT ATTCCCCATT CGAGCCAAAG AAGGAGGAGT CTTAAAAAGA GCAGGTCATA CAGAAGCTTC CATTGACTTA GCTAGACTAT CAGGTTTATA TCCAGCCGGA GTTATTTGTG AAATTCAAAA CCCAGATGGT TCCATGGCAA GATTACCTCA ACTAGTAGAA TATGCCCAAA CTCATAACCT CAAACTCATT AGTATTGCCG ACATCATCAG TTATCGCATC AAACACGATC GCTTTGTCTT CCGAGAAGCT GTTGCTAAAT TACCCTCTCA ATTCGGTAAC TTCCAAATTT ATGCTTACCG CGATACTCAA AATAATTTAG AACATATAGC TATTGTTAAA GGCAACCCAG CAGAATTTTC CCAAAGAGAT ATAATGGTGC GGGTCCACTC TGAGTGCTTA ACTGGAGATG CTTTTGGTTC TCTGCGTTGT GACTGTAGAA TGCAATTACA AGCAGCAATG AAAATGATTG AACATGCAGG TGCAGGAGTC GTTGTTTATT TACGACAGGA GGGCAGAGGG ATAGGGTTAG TTAATAAACT TAAAGCCTAT TCATTGCAAG ACATAGGATT AGATACTGTG GAAGCTAATG AAAGACTAGG CTTTCCAGCA GATTTACGCA ACTATGGTGT AGGAGCCCAA ATATTACATG ATCTAGGGGT CAATAAAATG CGTTTAATTA CAAATAACCC CCGTAAAATA GCAGGTTTAC ATGGTTATGG TATTGAAATA GTAGATCGAG TACCTTTGTT AATTGAAACG ACAGATTATA ATTCTGCTTA TCTAGCCACT AAGGCTCAAA AATTAGGTCA TATTTTGTTA CGGAGTTATT TAGTAACAAT TGCTATTAAT TGGAATAATC AAAAAATCGA AGAAGAAACT TTCGATAATT CTGATCATAT AAATATGAAG TCTTTAGCTC AACAGCGGTA TCAATATTTA GAAAAACTAC GCAGTTTGAT CAAAGAATAT GATTTTCTGT TGCAGGAAGA AACAAGGCCA GTAGCAACTG CAGTCTTTGC CCAAGCTCCT TTAATCGTTA ATTTTGGTTT AGAACAAGCA ACATTAACTA CATCTAAATG GTATCAAGAA TCAAATAATC CTTATTTGGT AGCGATCGCT AAAGTCTTAA CAGAAATAGC CCAATGGCAG AATATGTTAA AATTAGAATT TATCATTGCT TCTGGTTTAG ATCCTATGAT CGCCTTACAG ATAAAACTAG AACGTCAGAC CTTAGAAATT ACTGAATTAT CCACAGCTAT GGAACATTTA GAAACACAAA AAATTTACAG TTTAAAAATT TAG
|
Protein sequence | MNKQQNQKFK FDPIDTALAD IKAGKCVVVV DDEHRENEGD VICAAQFATP NMINFMAVKA RGLICLALTG DRLDQLEVPL MVTNNTDSNQ TAFTVSIDAS PELGVSTGIS AEDRARTIQA VINSNTKPED LRRPGHVFPI RAKEGGVLKR AGHTEASIDL ARLSGLYPAG VICEIQNPDG SMARLPQLVE YAQTHNLKLI SIADIISYRI KHDRFVFREA VAKLPSQFGN FQIYAYRDTQ NNLEHIAIVK GNPAEFSQRD IMVRVHSECL TGDAFGSLRC DCRMQLQAAM KMIEHAGAGV VVYLRQEGRG IGLVNKLKAY SLQDIGLDTV EANERLGFPA DLRNYGVGAQ ILHDLGVNKM RLITNNPRKI AGLHGYGIEI VDRVPLLIET TDYNSAYLAT KAQKLGHILL RSYLVTIAIN WNNQKIEEET FDNSDHINMK SLAQQRYQYL EKLRSLIKEY DFLLQEETRP VATAVFAQAP LIVNFGLEQA TLTTSKWYQE SNNPYLVAIA KVLTEIAQWQ NMLKLEFIIA SGLDPMIALQ IKLERQTLEI TELSTAMEHL ETQKIYSLKI
|
| |