Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_0279 |
Symbol | thiH |
ID | 5607013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 322193 |
End bp | 323326 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640935778 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001476517 |
Protein GI | 157368528 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00544631 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGATG ATTTCAGTAG CCGCTGGCAA CAGTTGGATT GGGACGATAT CTCGCTGCGT ATTAACAGTA AAACGGCGCG TGATGTCGAG CATGCGCTGA ATGCGGAAAA ACTGTCGCGA GAAGACTTTA TGGCACTGAT TTCCCCTGCC GCCGCGTCCT ACCTGGAGCC ACTGGCGCAG CGTGCGCAGC AGATGACCCG TCAACGTTTT GGCAACGTGG TTAGCTTCTA CGTGCCGCTG TACCTGTCCA ATCTGTGCGC CAACGACTGC ACCTACTGCG GCTTTTCGAT GAGCAACCGC ATCAAGCGTA AAACCCTGGA TGCCGCCGAG ATCGAACGTG AGTGTCTGGC GATTAAGGCG TTGGGCTTTG AGCATTTGCT GTTGGTCACC GGGGAACACC AGACCAAGGT CGGCATGGAC TATTTCCGCC AACACATTCC AGCGATCCGC CGCCATTTCA GCTCGCTGAT GATGGAAGTA CAACCGCTGG AGCAGCAAGA GTACGCCGAA TTAAAGGCTC TGGGGTTGGA TGGCGTGCTG GTGTATCAGG AAACCTATCA TCCCGCGACC TACCTGCAGC ATCATCTGCG CGGTCAGAAA CAGGACTTTC ACTGGCGGTT GGCCACACCG GATCGCCTGG GCCGCGCCGG GATCGACAAG ATCGGCCTCG GGGCCCTGAT CGGGCTTTCC AACAGTTGGC GTACCGACTG CTACATGCTG GCTGAACACC TGTTCTACCT GCAACAGACT TACTGGCAGA GCCGCTACTC GATCTCGTTC CCGCGCCTGC GCCCTTGTGC CGGAGGTATC GAACCGGCGT CGATCATGAG TGAACCGCAA TTGGTGCAGC TAATCTGTGC CTTCCGGCTG TTCGCCCCCG ACGTGGAGCT GTCTTTGTCG ACGCGTGAGT CGCCGTTTTT CCGCGATCAT ATGATCCCGG TGGCGATCAA CAGCGTCAGC GCCGGCTCCA AAACCCAGCC TGGCGGCTAT GCCGACGATG TGCCGCCGGA GCTGGAACAG TTTGAACCGC ACGATGGCCG TACCCCACAG CAGGTGGCAC AAGCCATCAG CGACGCTGGT TTACAGCCGG TGTGGAAAGA CTGGGACGGA TACCTGGGGC GCAGCCCGCA GTAA
|
Protein sequence | MADDFSSRWQ QLDWDDISLR INSKTARDVE HALNAEKLSR EDFMALISPA AASYLEPLAQ RAQQMTRQRF GNVVSFYVPL YLSNLCANDC TYCGFSMSNR IKRKTLDAAE IERECLAIKA LGFEHLLLVT GEHQTKVGMD YFRQHIPAIR RHFSSLMMEV QPLEQQEYAE LKALGLDGVL VYQETYHPAT YLQHHLRGQK QDFHWRLATP DRLGRAGIDK IGLGALIGLS NSWRTDCYML AEHLFYLQQT YWQSRYSISF PRLRPCAGGI EPASIMSEPQ LVQLICAFRL FAPDVELSLS TRESPFFRDH MIPVAINSVS AGSKTQPGGY ADDVPPELEQ FEPHDGRTPQ QVAQAISDAG LQPVWKDWDG YLGRSPQ
|
| |