Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3075 |
Symbol | thiH |
ID | 3904276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3642878 |
End bp | 3644044 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880396 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_482161 |
Protein GI | 86741761 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCCC CGCCAGGGGA GTTCGCGCGC GAACTTGCCG GTCACGACCT GGCCGGGCTG TCCGCGATGT CGCGGCGGGC CACGTCCGCC GAGGTGGACA CGGTGCTGCG GCGGGTTCGG CCCGGATCCG GTGGGTCGGC CACCCCCCGG ATCGGTCTCG CGGAGCTGGC GATCCTGCTC TCACCCGCGG CCACCGCCCG GTTGGAGGAG CTGGCGATTG CCGCCCGGGA GGCGACCCTG CTCCGGTTCG GTCGGGCGGT CCGGCTGTTC GCCCCGATGT ACGTGTCGAA CGCCTGCCTG TCGTCCTGCA CCTACTGCGG CTTCGCCAAG GGCCTGCCGG TGGTGCGCCG CACGCTGGAC CTTGAGGAGG TGGTGGCGGA GGGACGCCTG CTGGTCGACC GGGGGTTCCG TCATCTGCTG CTCGTCTCCG GCGAGCATCG CGTCGAGGTA TCGGCCGACT ACCTCGTCGC GGTGGTCGAG CGGCTACGGC CGATGTTCCC CTCGATCAGC ATCGAGACCC AGACCTGGTC GGACGACACC TATGCCCGGC TGGTGCGTGC CGGGTTGGAG GGAGTCGTCC ATTACCAGGA GACCTACGAC CGGGCCCGCT ACGCCGAGGT GCACGTCGCC GGCTGGAAGC GTGACTACGA CCGGCGGCTG TCCTCGTTCG AACGGGCCGC GCGGGCCGGG GTGCGCCGGC TCGGGATCGG CGCGCTGCTC GGGCTCGCCC CGGACTGGCG CGCTGACGTG CTCGCGCTGG CCGCGCACGC CTCCTTTCTC TCCCGTCGCT ACTGGCGGAC CGAGGTCTCG ATCGCGCTGC CGCGCATCAA GCCGAGTGCC AGCGGGTTCC CGCCGGTCGT CGTCGTTGAC GACGCGGAGT TCGTCCAGGC GTACGCGGCC CTGCGCCTGT TCGAGCCGGA CCTCGCGCTG TCGCTGTCGA CCCGGGAGCC GGCGGTGCTG CGCGACGGGC TCGTCCGCAT CGCCGTGACC ACCATGAGCG CCGGCTCCTC CACCGAACCC GGCGGCTACA CCACGCCCGG CACGGCCCAG GAACAGTTCT CGATCTCCGA CGAGCGTTCG CCGGCACAGA TCGCCGCGAT GCTCGCCGAA GCCGGGTACG AGCCGGTGTG GAAGGACGCC TTTCCGCTTG TCGACCCGGT GCGGTGA
|
Protein sequence | MSAPPGEFAR ELAGHDLAGL SAMSRRATSA EVDTVLRRVR PGSGGSATPR IGLAELAILL SPAATARLEE LAIAAREATL LRFGRAVRLF APMYVSNACL SSCTYCGFAK GLPVVRRTLD LEEVVAEGRL LVDRGFRHLL LVSGEHRVEV SADYLVAVVE RLRPMFPSIS IETQTWSDDT YARLVRAGLE GVVHYQETYD RARYAEVHVA GWKRDYDRRL SSFERAARAG VRRLGIGALL GLAPDWRADV LALAAHASFL SRRYWRTEVS IALPRIKPSA SGFPPVVVVD DAEFVQAYAA LRLFEPDLAL SLSTREPAVL RDGLVRIAVT TMSAGSSTEP GGYTTPGTAQ EQFSISDERS PAQIAAMLAE AGYEPVWKDA FPLVDPVR
|
| |