Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1837 |
Symbol | thiH |
ID | 5670239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2206156 |
End bp | 2207322 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240758 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001506181 |
Protein GI | 158313673 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.151757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.173638 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCC CGGCAGGGCT GTTCGCCCGC GAGCTCGCCG CGCTCGACAT CCCGGCGCTC GCCCGTGTCT CGGTCGAGGC CGACGAGGCG CGGGTCGACG CCGTCCTGCG CCGGGCCGTC GCTGCCGGGC GGCCCGACGC CGGCGGCCGG CTCGACCTCG CCGACCTCGC CGTCCTGCTG TCGCCGGCCG CGACCGGGCG GCTGGAGGAG CTGGCGCAGG CGGCGCGGGA GACGACGCTG CGCCGGTTCG GCCGGGCGGT GCGGCTGTTC GCCCCGCTGT ACGTGTCGAA CGCCTGCCTG TCGTCCTGCA CCTACTGCGG GTTCGCCAAG GGGCTGGAGG TGGCCCGGCG CACCCTGACG GTCGACGAAG CCGAGGCCGA GGCACGCCTG CTGGCCGACC GCGGCTTCCG GCACATCCTG CTGGTCTCCG GGGAGCACCG CGTCGAGGTC TCCGCCGGGT ACCTGGTGGA CGTCGTCGAG CGGCTGCGAC CGTTCGTCCC CTCGATCTCG GTGGAGACCC AGACCTGGTC GGACGACACC TACAGCCGGC TGGTCGTGGC CGGGCTCGAC GGCGTCGTCC ACTACCAGGA GACCTACGAC CGGGAGCGCT ACGCGCAGGT GCACGTGGCC GGGTGGAAGC GCGACTACGA CCGCCGGCTG TCCTCCTTCG AGCGGGCGGC CCGCGCCGGC GCCCGCCGTC TGGGCCTCGG CGTCCTGCTC GGCCTGGCGC CGGACTGGCG GGCCGACGTC CTCGCGCTCG CCGCGCACGC CTCGTTCCTC GCCCGCCGCT TCTGGCGGAC GGAGGTCTCG GTGGCGCTGC CGAGGATCAA GCCGAGCGCC AGTGGCTTCC CACCGACCGT CGTCGTCGGC GACGCCGAGT TCGTCCAGGC GCACGCGGCG CTGCGGCTGT TCGAACCGGA CGCGGCGATC TCGCTGTCGA CCCGCGAGCC GGCGGCCCTG CGTGACGGCC TGGTCCGCAT CGCGGTGACC ACGATGAGCG CCGGCTCGTC CACCGAGCCA GGTGGGTACG GGCGGCCCGG GACGGCGCAG GAGCAGTTCT CCATCTCCGA CGAGCGATCC CCGGCGGACG TCGCCGCGAT GCTCGTCGGC GCCGGCTACG AGCCTGTCTG GAAGGACGCG TTCCCGCTGG TCGACGCCGC CGGCTGA
|
Protein sequence | MASPAGLFAR ELAALDIPAL ARVSVEADEA RVDAVLRRAV AAGRPDAGGR LDLADLAVLL SPAATGRLEE LAQAARETTL RRFGRAVRLF APLYVSNACL SSCTYCGFAK GLEVARRTLT VDEAEAEARL LADRGFRHIL LVSGEHRVEV SAGYLVDVVE RLRPFVPSIS VETQTWSDDT YSRLVVAGLD GVVHYQETYD RERYAQVHVA GWKRDYDRRL SSFERAARAG ARRLGLGVLL GLAPDWRADV LALAAHASFL ARRFWRTEVS VALPRIKPSA SGFPPTVVVG DAEFVQAHAA LRLFEPDAAI SLSTREPAAL RDGLVRIAVT TMSAGSSTEP GGYGRPGTAQ EQFSISDERS PADVAAMLVG AGYEPVWKDA FPLVDAAG
|
| |