Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3042 |
Symbol | |
ID | 4244692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4694429 |
End bp | 4696336 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638108072 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_722665 |
Protein GI | 113476604 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00899541 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATCTAA GCGAAATTAC TCATCCTAAA CAGTTACACA ATCTTTCAAT TCATCAACTA GAAGAAATTG CTAGACAAAT ACGAGAGAAA CATCTAGAAA CAGTTGCCAC AAGTGGAGGC CATCTAGGTC CAGGGTTAGG AGTTGTAGAG CTGACATTAG GTCTCTACCA AACTCTCAAT CTAGATCGGG ACAAAGTTAT TTGGGATGTA GGCCATCAAG CTTATCCTCA TAAAATTATC ACAGGTCGAT ATCACAATTT CCATACTTTG CGACAAAAAG ATGGTATTGC AGGTTATTTA AAACGATGTG AAAGTAAATT CGACCATTTT GGTGCTGGTC ATGCCTCTAC AAGTATCTCT GCTGGTTTGG GGATGGCCTT AGCCAGAGAT ATGAAAGGAG ATAACTTTAA AGTAGTTTCC ATTATTGGGG ATGGTGCTTT AACTGGTGGT ATGGCTTTAG AAGCCATTAA CCATGCAGGT CACTTACCCA AAACTAACAT ACTAGTTGTG TTAAACGATA ACGAGATGTC CATTTCTCCT AACGTAGGTG CAATTTCTCG CTATCTCAAC AAAATGCGCT TATCCCCACC GATACAGTTT CTTCAGGATA ACTTAGAAGA ACAATTCAAA CAGATTCCTT TTGTTGGTGA AACCTTTACA CCAGAAATGG AAGGCCTCAA AGGAGGAATG AAGCGTTTAG CAGTGTCAAA AGTGGGGGCT GTAATTGAAG AATTGGGCTT TACCTATATG GGTCCTGTTG ATGGGCATAA TTTAGAAGAA TTAATTACAA CTTTCAACCA AGCTCATCAA ATTCCCGGAC CAGTATTAGT TCATGTTGCC ACAACTAAGG GTAAAGGATA CCCAGTTGCT GAAGAGGATA AAGTTAGTTA TCATGCTCAA AATCCCTTTA ATTTGGCCAC GGGTAAAGCT TTACCAGCAA GTAAGCCAAA GCCTCCTAAA TACTCTAAAG TTTTTGCCCA TACTTTGGTG AAACTTGCGG AAAATAACCC CAAAATTATT GGTATTACTG CTGCTATGGC AACTGGCACA GGTTTAGATA AACTTCATGG GAAACTGCCA AAACAATATA TAGATGTTGG TATTGCAGAA CAACATGCTG TGACTTTGGC AGCAGGTTTA GCTTCTGAAG GAATGCGACC AGTGGTTTGT ATTTATTCAA CTTTCTTGCA ACGAGCTTAT GACCAAATTA TCCATGATGT CTGTATTCAA AAGTTACCTG TATTCTTCTG TTTAGACCGT GCTGGTATTG TAGGTGCAGA TGGTCCGACT CACCAGGGAA TGTATGATAT TGCTTATTTA CGTTGTATCC CAAATATGGT AGTTATGGCG CCTAAGGATG AGGGTGAGTT ACAACGGATG GTGTTGACTG GTATTAAACA TACGGATGGG GCGATCGCTA TGCGTTATCC TCGTGGTAAT GGTTATGGTG TGCCTCTGAT GGAAGAGGGT TGGGAAGCTA TTACTATTGG TAAGGGTGAG ATTCTGCGGA ATGGTGATGA TGTGCTGATA TTAGGTTATG GGTCTATGGT CTATTCGGCT ATGCAAACAG CAGAAATTCT CAGCGAGCAT GGTGTTGCTG CTACTGTCGT AAATGCTCGT TTTGTCAAGC CTTTGGATAC AGAATTAATT CTACCTTTGG CACAACGCAT TGGTCAGGTT GTGACTATGG AGGAAGGTTG TTTGATGGGT GGTTTTGGTT CGGCAGTTAC GGAAGCATTG ATGGATAATA ATGTACTTGT ACCTGTTTTA CGTTTGGGTG TACCTGATAA GTTGGTGGAT CATGCTAAAC CTGATGAGTC GAAGGCTGAT TTGGGTTTGA CTCCTTCTCA AATGGCAGAA CGTATTTTGC AGAGTTTTAA GCCTAGGTTA TCGACTATTA ATGTTTAA
|
Protein sequence | MHLSEITHPK QLHNLSIHQL EEIARQIREK HLETVATSGG HLGPGLGVVE LTLGLYQTLN LDRDKVIWDV GHQAYPHKII TGRYHNFHTL RQKDGIAGYL KRCESKFDHF GAGHASTSIS AGLGMALARD MKGDNFKVVS IIGDGALTGG MALEAINHAG HLPKTNILVV LNDNEMSISP NVGAISRYLN KMRLSPPIQF LQDNLEEQFK QIPFVGETFT PEMEGLKGGM KRLAVSKVGA VIEELGFTYM GPVDGHNLEE LITTFNQAHQ IPGPVLVHVA TTKGKGYPVA EEDKVSYHAQ NPFNLATGKA LPASKPKPPK YSKVFAHTLV KLAENNPKII GITAAMATGT GLDKLHGKLP KQYIDVGIAE QHAVTLAAGL ASEGMRPVVC IYSTFLQRAY DQIIHDVCIQ KLPVFFCLDR AGIVGADGPT HQGMYDIAYL RCIPNMVVMA PKDEGELQRM VLTGIKHTDG AIAMRYPRGN GYGVPLMEEG WEAITIGKGE ILRNGDDVLI LGYGSMVYSA MQTAEILSEH GVAATVVNAR FVKPLDTELI LPLAQRIGQV VTMEEGCLMG GFGSAVTEAL MDNNVLVPVL RLGVPDKLVD HAKPDESKAD LGLTPSQMAE RILQSFKPRL STINV
|
| |