Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4537 |
Symbol | |
ID | 4246191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7000589 |
End bp | 7001437 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638109414 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_723990 |
Protein GI | 113477929 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0574079 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAAA CTAAACTTGT ATCCAAGTCT CACTCGGAAG ATCAAACAGT TGTCAATATT TCTGAACAGA CAACGGTAGG AGGTACAAAT TTTTTAATTA TTGGAGGTCC GTGCACTGTT GAAAGTATGG AACAAATGAC AGCAGTAGCA AATAAGTTAA TAATGGCACC TGTGCAAATG TTCCGGGGAG GTGTCTACAA ACCCCGCACT TCTCCCTACT CATTTCAGGG GTTGGGAATA GAAGGATTAA AAATTTTGGC ATCCGTTCGC CAACGCTACA ATATCCCAGT AGTAACAGAA GTTATGTCTA TTCCGCAAAT AGAGTCAGTA GTTGCTTATG CAGATATGCT GCAAATAGGT AGTCGCAATA TGCAAAACTT TGAGTTACTC AAAGCAGTTG GGAAAGCTGG CAAATCTGTG ATTTTGAAAC GGGGGCTAGC TGCAACTATT GAGGAATTTT TAATGGCAGC TGAATATATT CTCAGTTATG GCAACCCTAA TGTGGTCTTA TGTGAAAGAG GTATTCGTAG TTTTGATAGT TATACTCGTA ATGTTTTAGA TTTGGGTGCG GTAGCTGCAT TGAAACAGTT GACTCATTTG CCAGTTATTG TCGATCCTTC TCATGCTGCT GGTAGAAGGG AGTTGGTGGC ATCCTTGGCA AAAGCTGCGG TTGCCTGCGG AGCGGATGGT TTGATGATAG AGTGTCATCC TATACCGGAG AAGTCGGTTT CGGATGCCCA GCAAGCATTG TCGTTGGAAG ATATGGTGAA GTTAGTTCAA AGTTTACGCC CTATTGCTAC TGCAGTAGGG CGATATATTT GTGAACTTGA GGAGTTAGCT GTGGCTTAA
|
Protein sequence | MSKTKLVSKS HSEDQTVVNI SEQTTVGGTN FLIIGGPCTV ESMEQMTAVA NKLIMAPVQM FRGGVYKPRT SPYSFQGLGI EGLKILASVR QRYNIPVVTE VMSIPQIESV VAYADMLQIG SRNMQNFELL KAVGKAGKSV ILKRGLAATI EEFLMAAEYI LSYGNPNVVL CERGIRSFDS YTRNVLDLGA VAALKQLTHL PVIVDPSHAA GRRELVASLA KAAVACGADG LMIECHPIPE KSVSDAQQAL SLEDMVKLVQ SLRPIATAVG RYICELEELA VA
|
| |