Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4068 |
Symbol | |
ID | 4242096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6281793 |
End bp | 6282737 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638108971 |
Product | O-succinylbenzoate synthase |
Protein accession | YP_723552 |
Protein GI | 113477491 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01927] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000591111 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCATTATA AACTAAAAAT TTTTCCCTAT AAACGGAACT TCAAACAACC TTTAAAAACA AGTCATGGTA TTTGGCGTAT TAGAGAAGGA ATAATTTTAA AACTAACCAA TAAAACTGGA GAAATAGGTT TAGGTGAAAT TGCACCTTTA AGTTTTTTTG GTTCGGAAAC TTTGTCGGAA GCTTTAGATT TTTGTCACCA GTTACCACCA AAAATTACAA CAGAAACTAT TTTTTCTATT CCCGATAATT TACCTAGTTG TAAATTTGGC TTTGAGTCAG CTTGGGAAAA TTTTCAAGAA GTACAAGTAC AGGAAAAAGA AGAATTAAAC CCGAAATTTT ACAGTGCTCT ATTACCAGCA GGAAAAGCAG CTTTAGAAAC TTGGCAGCAA CTTTGGCAAA AAGGATACCG CACTTTCAAA TGGAAAATTG GACTTGATCA AATTCAAACA GAAATAAAGA TATTTCAGCA GTTAGTTAAA GAACTCCCTA CCGGAATAAA TTTAAGATTG GATGCCAATG GAGGATTAAA TTTTGAAGAA GCAAAACAAT GGTTAGAAAT ATGCGAAAAT ATCAATATAA TTGAGTTACT TGAGCAACCT TTACCTGTTG ATAAATTTAT CGAAATGTTG GATTTAAGTA ATTTCTATTC TACCCCTATT GCCTTAGATG AATCTGTTGC TAACCTCAGC AAAATGCAAC AATATTATCA ACAAGGTTGG CAAGGAATAT TCGTAATTAA ACCTGCTATT TTTGGTTCCC GTATTGATCT CTGTAACTTT TGTCAAAATT ATCTCATTGA TGCAGTTTTT TCTTCAGTAT TTGAAACAAA AATTGCTAGA AAAGCATCAT TGCAGCTTGC CACAAAATTA CAACCAAACT TAAGAAAAAA CCGTGCTTTT GGTTTTGGTA TCACCCACTT TTTTGATGCA GATATAGAAA ATTAA
|
Protein sequence | MHYKLKIFPY KRNFKQPLKT SHGIWRIREG IILKLTNKTG EIGLGEIAPL SFFGSETLSE ALDFCHQLPP KITTETIFSI PDNLPSCKFG FESAWENFQE VQVQEKEELN PKFYSALLPA GKAALETWQQ LWQKGYRTFK WKIGLDQIQT EIKIFQQLVK ELPTGINLRL DANGGLNFEE AKQWLEICEN INIIELLEQP LPVDKFIEML DLSNFYSTPI ALDESVANLS KMQQYYQQGW QGIFVIKPAI FGSRIDLCNF CQNYLIDAVF SSVFETKIAR KASLQLATKL QPNLRKNRAF GFGITHFFDA DIEN
|
| |