Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1836 |
Symbol | thiG |
ID | 5670238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2205245 |
End bp | 2206156 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240757 |
Product | thiazole synthase |
Protein accession | YP_001506180 |
Protein GI | 158313672 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.138321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.169503 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGC GGGTCACCGA GGTCCGACGC GAGCCGGACA AGTCCGATCA TCCGGAGTCC GATCATTCGG AGTCCGATCA TCCGGAGCGC GGTCATCCCG ACCCGTTCCG GATCGCCGGC ACCGTCTACG CCAGCCGGCT CCTCGTCGGC ACCGGCAAGT TCGCGAGCCA TCCGGTCATG CGCGACAGCC TGGTCGCCTC GGGGGCGGAC ATCGTCACCG TCGCCCTGCG CCGGGTCGAC CTGAGCCGCG CGGGGGAGGG CGACGTGCTC GACTTCGTCC CGGCCGGCAT GACGCTGCTG CCGAACACCT CCGGCGCGCA GGACGCGGCC GAGGCGCTGC GGCTGGCCCG GCTCGGCCGC GCGGCGACCG GGACGTCCCT GGTGAAGCTG GAGGTCACGC CGGATCCGCG CACCCTCGCG CCGGACCCGA TCGAGACGCT GCGCGCCGCC GAGCTGATGG TCGCCGACGG GTTCACCGTG CTCCCGTACT GCTCGGCCGA CCCGGTGCTG GCACGCCGGC TCGAGGAGGC CGGCTGCGCC ACGGTGATGC CGCTGGGTAG CTGGATCGGT TCCAACCGCG GCCTGCGCAC CCGCGACGCG ATCGAGGCGA TCGTGGAGAC CGCCGGGGTC CCGGTGGTGG TGGACGCCGG CATCGGCGCG CCCTCCGACG CCGCCGAGGC GATGGAGATC GGGGCGGACG CGGTGCTCGT CAACACGGCG ATCGCGATCG CCGCCGACCC GGTCGCGATG GCCCGGGCCT TCGCGCTCGC GACCATCGCC GGGCGGATGG CCCACCTCGC CGGCAGGCCG CGGGCGGGCA GCGCCACCGT GGCCGAGGCG TCCTCTCCGC TCACCGGTTT CCTGGGCGCG GTACCCGGCG GCCTGCCCGG TCTGCCCGGC GGGGGCGGCT GA
|
Protein sequence | MTQRVTEVRR EPDKSDHPES DHSESDHPER GHPDPFRIAG TVYASRLLVG TGKFASHPVM RDSLVASGAD IVTVALRRVD LSRAGEGDVL DFVPAGMTLL PNTSGAQDAA EALRLARLGR AATGTSLVKL EVTPDPRTLA PDPIETLRAA ELMVADGFTV LPYCSADPVL ARRLEEAGCA TVMPLGSWIG SNRGLRTRDA IEAIVETAGV PVVVDAGIGA PSDAAEAMEI GADAVLVNTA IAIAADPVAM ARAFALATIA GRMAHLAGRP RAGSATVAEA SSPLTGFLGA VPGGLPGLPG GGG
|
| |