Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00531 |
Symbol | thiG |
ID | 4778154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 53333 |
End bp | 54253 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085553 |
Product | thiazole synthase |
Protein accession | YP_001016075 |
Protein GI | 124021768 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAC ATGATGATGG GCACAAGCTA AAGCCAATCG GCTCCACCGG GACACTCCGC AATGACGTTC CAACAACCTC AACAGCGCCA ATGGTTCTGA CGCACTCCTC ACCTGACCTG CTCACAATCG GCAAACGCAA CTTCCACAGC CGACTTTTTG CAGGCACAGG GAAGTACCCA AGCCTAAAGG TGATGCAACA GAGTCTGCGC TCTTCAGGAT GCGAAATGGT GACGGTGGCA GTTCGCCGTG TACAGGCAAT GACATCAGGC CACGCCGGCC TGATCGAAGC CATCGACTGG TCAAAAATAT GGATGCTGCC CAACACCGCT GGCTGTGCAA CTGCAGAGGA AGCCATTCGT GTAGCCCGAC TCGGCCGCGA ACTCGCCAAA CTGGCAGGCC AGGAGGACAA CAAATTCGTC AAGCTCGAAG TCATCCCCGA CAGCCGCCAC CTCCTACCTG ACCCTTTCGG TACCCTTGAG GCCGCTGAAC AATTGGTGAA AGAAGGCTTC GAGGTTTTGC CATACATCAA TGCCGATCCC CTCTTGGCTA AGCGCCTTGA AGAGGTGGGC TGTGCGACGG TCATGCCACT GGGCTCACCG ATTGGATCAG GTCAAGGTCT GAGGAATGCC GCCAACATCT CCCTAATCAT TGAAAACGCC CGAGTGCCAG TCGTTGTGGA TGCTGGCATC GGTGTCCCCA GTGAAGCTGC ACAGGCCCTG GAAATGGGTG CCGACGCAGT GCTGGTGAAC AGCGCAATTG CCTTAGCCGG TGATCCAATC ACCATGGCCG AAGCAATGCG TTGGGCGATT CAGGCCGGTA GGCAGGCTTA CAGATCAGGT CGCCTACCCG AACGCACCGC AGCCTCGCCA AGCTCACCGA CAACAGGGAT TATTACGGAA GCCAAGACTA AACAATTATG A
|
Protein sequence | MEKHDDGHKL KPIGSTGTLR NDVPTTSTAP MVLTHSSPDL LTIGKRNFHS RLFAGTGKYP SLKVMQQSLR SSGCEMVTVA VRRVQAMTSG HAGLIEAIDW SKIWMLPNTA GCATAEEAIR VARLGRELAK LAGQEDNKFV KLEVIPDSRH LLPDPFGTLE AAEQLVKEGF EVLPYINADP LLAKRLEEVG CATVMPLGSP IGSGQGLRNA ANISLIIENA RVPVVVDAGI GVPSEAAQAL EMGADAVLVN SAIALAGDPI TMAEAMRWAI QAGRQAYRSG RLPERTAASP SSPTTGIITE AKTKQL
|
| |