Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4376 |
Symbol | |
ID | 4246029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6743830 |
End bp | 6744918 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638109263 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_723840 |
Protein GI | 113477779 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0936456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.529135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA GTATTAAACT TATTTCCAAT CGTGCCTTAT TTTACTTGGC CATTCTACCT GTAAGCTGTG GTGTTTTTGC TTGGCAGGGT TGGAGTTGGT GGAGTTGGGT AAGTAGACCT GTAGTTTCAC CAACATCCTC TACTCAGTCT TCACAAGCTA ATGCTATAAG AATTAAAATT CCTGTGGGAA CTTATGGTCA ACAAATAGGT GAGTATTTAG AAGATGCTGG TATTATTCGC TCTGCAACAG CTTGGAATTT ATGGGTAAAA TGGTTGAGTC TACAAAATCC CAATCTTGAG TTTAAAGCTG GAACTTATAA TTTATTGCCT ACAGAACCAC TAAGCGCGAT CGCAGATAAA ATTCTACAGG GAGATGTAGT TAAACTCAGC TATGTCATTC GTGAAGGATG GTCAATTCAA CAAATGGCTG CATATTTGGA TGATGAAGGT TTTTTTCCAG CTGCTGATTT TATTGCAGCA ACAAAAAATA TTCCCTATGA TAAGTTTCCA TGGTTACCAA CTAATATACC TCATCTAGAG GGTTATTTAT TCCCAGATAC TTATAAAATA GTAGCGGATA ATATTACTCC AGAAGCTATT ATCAATCAAA TGATAGGACA GTTTGAACAA GTAGCTTTGC CAGTTTATCA GAAAAACCAG AACAATACAA CAAAATTGAG TCTTCATGAA TGGGTAAGTT TAGCAAGTAT TGTAGAAAAG GAAGCTGTAG TTGCACAAGA ACGTGGTTTA ATTTCGGGGG TGTTTAATAA CCGTTTGGAA CAGGGTATGA GGTTAGCAGC AGACCCAACA GTAGAATATG GTCTTGGTAT TCGTCAAACG AAAGATAAGC CTCTTACTTA TAGTCAGATT GAAACTCCTT CACCTTATAA TACTTATATG AATACTGGGT TACCACCAAC TCCTATTTCT AGTCCAGGTA AGGCCAGTTT GGAAGCAACT CTTAATCCAG AAGATACAGA ATATTTGTAT TTTATGGCTC GCTATGATGG TACCCATATT TTTAGTCGTA CTGCTAGAGA ACATGAGGCT GCTATTGCAG AGGTAGAGAG ATTGTTATCA TCTCAGTAA
|
Protein sequence | MKKSIKLISN RALFYLAILP VSCGVFAWQG WSWWSWVSRP VVSPTSSTQS SQANAIRIKI PVGTYGQQIG EYLEDAGIIR SATAWNLWVK WLSLQNPNLE FKAGTYNLLP TEPLSAIADK ILQGDVVKLS YVIREGWSIQ QMAAYLDDEG FFPAADFIAA TKNIPYDKFP WLPTNIPHLE GYLFPDTYKI VADNITPEAI INQMIGQFEQ VALPVYQKNQ NNTTKLSLHE WVSLASIVEK EAVVAQERGL ISGVFNNRLE QGMRLAADPT VEYGLGIRQT KDKPLTYSQI ETPSPYNTYM NTGLPPTPIS SPGKASLEAT LNPEDTEYLY FMARYDGTHI FSRTAREHEA AIAEVERLLS SQ
|
| |