Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_0626 |
Symbol | |
ID | 8356737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 762761 |
End bp | 763870 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644962777 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003120325 |
Protein GI | 256419672 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGAT TCAAAGACAT TTTTGACCAG CACGACTGGG ACGATATAAA AGTCTCCATT TATGCCAAAA CAGCCCGTGA TGTAGAAGCT GCGCTCTATA GTAACAAACG GACGCTGGAG GACTTTAAGG CGCTGATATC ACCGGCTGCG GCGCCTTATC TTGAACAGAT GGCCCAACTC AGCCGGCAGC TGACGCAACA ACGTTTTGGC AACACCATGC AATTGTACAT ACCGCTGTAC CTCAGTAATG AATGTCAGAA TATCTGTACC TACTGCGGTT TCAGTCTGGA TAACAAGATT GCCAGAAAGA CGCTGAATAA GGACGAAATA CTGCGGGAAG TAGCCGTTAT TAAGGCAATG GGATACGATC ATGTATTGCT GGTAACCGGA GAGGCAAATC AGACGGTAGG GCTGCAGTAT TTCCAGGAGG CACTGGAGAC CATCCGGCCG CATTTTGCGA ACATTTCCAT GGAGGTACAA CCCATGGACG AAGCGGACTA TGCAGCGCTA AAACCTCATG GTTTGCATGG CGTGCTCGTC TACCAGGAAA CCTATCATCA GGCAGATTAT AAGCTGCATC ATCCAAAGGG TAAGAAATCC AACTTCCACT ACCGGCTGGA TACACCGGAC AGGCTCGGCA GAGCAGGTAT CCACAAAATG GGGTTGGGGG TGCTGATCGG ACTGGAAGAC TGGCGTACAG ACAGCTTCTT TACGGCCTTA CACCTGCAAT ACCTGGAAAA GACCTACTGG CAAACAAAGT ACAGTATTTC ATTTCCAAGG TTGCGCCCCT GTTCAGGCGG ATTGCCTCCT AAAGTGGAAA TGAACGACCG GGAACTGGTA CAGCTGATCT GTGCCTACCG GCTACTGGAT CAGGAAGTGG AATTAAGCCT CTCTACCCGC GAAACGCCGA GGTTCCGGGA CAATGTTATC AAACTGGGTA TTACAGCGCT CAGTGCAGGC TCTAAAACGA ATCCCGGAGG GTATGCCACG GATCTGTCTT CCCTGGAGCA ATTTGAGATA TCGGATGACC GTAGTCCGGC GAGCATCGGT GGTATGCTGC GGGCGCAGGG ATATGAGCCG GTCTGGAAGG ATTGGGACGA AGGCTATTAA
|
Protein sequence | MSGFKDIFDQ HDWDDIKVSI YAKTARDVEA ALYSNKRTLE DFKALISPAA APYLEQMAQL SRQLTQQRFG NTMQLYIPLY LSNECQNICT YCGFSLDNKI ARKTLNKDEI LREVAVIKAM GYDHVLLVTG EANQTVGLQY FQEALETIRP HFANISMEVQ PMDEADYAAL KPHGLHGVLV YQETYHQADY KLHHPKGKKS NFHYRLDTPD RLGRAGIHKM GLGVLIGLED WRTDSFFTAL HLQYLEKTYW QTKYSISFPR LRPCSGGLPP KVEMNDRELV QLICAYRLLD QEVELSLSTR ETPRFRDNVI KLGITALSAG SKTNPGGYAT DLSSLEQFEI SDDRSPASIG GMLRAQGYEP VWKDWDEGY
|
| |