Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3076 |
Symbol | thiG |
ID | 3904277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3644041 |
End bp | 3645054 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637880397 |
Product | thiazole synthase |
Protein accession | YP_482162 |
Protein GI | 86741762 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCAGC ACCCCCTACG TCCTGCCGGT TCCCGTCCCG GCGATTCTCC CCCCGACGGT TCCTGTCCCG ATGGCCTCGC CGGTGGAGGT TCCGCCGTCG GCGGCGGGGG TGGTGGTGAG GCCGGACCCG GTGCCGCGGG GGCGGACGAT CTCGTCATCG CCGGGCGGGC CTACGGCAGC CGGCTGCTGA TCGGCACCGG GAAGTTCGCG AGTCACCGGG TGATGCGGGA CAGCCTCGTC GCATCCGGTG CCACCATCGT CACCGTGGCG CTGCGCCGGG TCGACATCGG CCGGATCGGT GACGGCGACG TTCTCGACTT CATCCCGCCG TCGATGACCC TGCTGCCCAA CACCTCGGGC GCGGTGGACG CGGCCGAGGC GCTGCGGCTG GCCCGCCTCG GGCGGGCGGC GACGGGCACC GGTCTGGTCA AGCTGGAGGT CACCCCGGAT CCGCGCACCC TGGCGCCGGA TCCGCTGGAG ACCCTGCGCG CCGCCGAGCT CATGGTGGCC GACGGCTTCA CGGTCCTGCC CTACTGTTCG GCCGACCCGG TGCTGGCCCG CCGGCTGGAG GAGGTCGGCT GCGCCACCGT GATGCCGCTG GGCAGCTGGA TCGGCTCGAA CCGCGGGCTG CGCACCCGGG ACGCGATCGA GGCAATCGTG GCGAGCGCCG GGGTGCCGGT GGTCGTGGAC GCCGGGCTCG GGGTGCCCTC GGACGCGGCC GAGGCGATGG AGATCGGCGC GGCCGCGGTG CTGGTCAACA CCGCTATCGC GGTGGCCGCC GATCCGGCGC GCATGGCGCG GGCCTTCGCG CTCGCCACCG CCGCCGGCCG GTTGGGTTTC CTGGCCGGCC GAGGGGCTGC CGGCCCGGCC ACGGTGGCGT CCGCCTCCTC CCCGCTCACC GGGTTCCTCG GCGCCCACCC GTCACCGGCG TCCCACCCGT CACCGGCGTC GCCCGTGCCC TCGGTGTCCC GGGCGACGTC CCCGGCTGCG GTGGTGGGCG AGGCGTCCAG GTGA
|
Protein sequence | MSQHPLRPAG SRPGDSPPDG SCPDGLAGGG SAVGGGGGGE AGPGAAGADD LVIAGRAYGS RLLIGTGKFA SHRVMRDSLV ASGATIVTVA LRRVDIGRIG DGDVLDFIPP SMTLLPNTSG AVDAAEALRL ARLGRAATGT GLVKLEVTPD PRTLAPDPLE TLRAAELMVA DGFTVLPYCS ADPVLARRLE EVGCATVMPL GSWIGSNRGL RTRDAIEAIV ASAGVPVVVD AGLGVPSDAA EAMEIGAAAV LVNTAIAVAA DPARMARAFA LATAAGRLGF LAGRGAAGPA TVASASSPLT GFLGAHPSPA SHPSPASPVP SVSRATSPAA VVGEASR
|
| |