Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_1081 |
Symbol | |
ID | 5606831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 1189053 |
End bp | 1190501 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640936600 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001477313 |
Protein GI | 157369324 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00012217 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AGAGCCAATC TGTGCGCTTG CGCTTTATCA AGATCCTCTC GACCAGTATT CGCAACGTCC TGAAGCAGTA TGATGAAACA CTGGCGGTTG TCCGTCACTG GGATCATATC GAAGTTCGCG CCAAAGATGA AAACCAGCGG CCGGTGATTG CTGACGCGCT GACGCGCATT CCCGGTATCC ACCACATTCT GGAAGTGGAA GACCGCGACT ATACCGATAT CCACCATATT TTCGAGCAGA CGCTGGAAGC CTACCGCGAG CAACTGGAAG GCAAGACCTT CTGCGTGCGC GTTAAGCGCC GCGGCAAGCA GGACTTCAAT TCGCAGGACG TGGAGCGTTA CGTCGGTGGC GGTCTGAACC AGCATATTGA AAGCGCGCGC GTTAAGCTGT CGCATCCGCA GGTGACGGTT AACCTGGAAA TCGAAAACGA CAAGCTGATG CTGGTGAAAG CCCGCCGTGA AGGCATCGGC GGCTACCCGG TAGGCACCCA GGAAGACGTG CTGTCGCTGA TTTCCGGTGG TTTCGACTCG GGTGTTTCCA GTTATATGCT GATGCGCCGC GGTTGCCGTG TGCATTATTG CTTCTTCAAT CTGGGTGGCG CGGCGCATGA AATTGGCGTG CGCCAGGTAG CTCACTATCT GTGGAACCGC TTTGCCAGTT CGCACAAGGT GCGCTTTATC GCCATCGATT TTGAACCTGT GGTTGGCGAG ATCCTGGAAA AGGTCGAAGA CGGCCAGATG GGCGTGGTGC TCAAGCGCAT GATGGTGCGT GCCGCTTCGC AGATTGCCGA ACGTTATGGC GTGCAGGCAT TGGTGACCGG TGAAGCGCTG GGGCAGGTAT CCAGCCAGAC GCTGACTAAC CTGCGCCTGA TCGACAACGC GTCCGATACG CTGATCCTGC GTCCGCTGAT CTCCCACGAC AAAGAGCACA TCATCAAAGT GGCGCGTGAG ATCGGCACCG AGGACTTCGC CAAGACCATG CCGGAGTACT GCGGTGTGAT CTCGAAAAGC CCGACGGTGA AGGCGATTAA AGCCAAGATC GAAGAAGAAG AAGGGCACTT TGATTTCAGC ATTCTTGATC GCGTGGTTAG CGAAGCCAAG AACGTGGATA TTCGCACCAT CGCCGAGCAG ACTCAGGAGC AGGTTACCGA AGTCGAAACC GTGGCGGAGT TCGATGCTGA CCAGGTGATT CTGGATATTC GTTCTAACGA CGAGCAGGAA GATAAACCGC TGAAGCTGGA TCAGGTTGAG GTGAAACCGC TGCCGTTCTA CAAGCTCAGC ACCCAGTTTG GCGATTTGGA TCAGAGCAAA ACTTACCTGC TGTACTGTGA GCGCGGCGTG ATGAGCCGCC TGCAGGCGCT GTACCTGCTG GAGCAGGGGT TCACCAACGT GAAGGTTTAC CGCCCATAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILSTSI RNVLKQYDET LAVVRHWDHI EVRAKDENQR PVIADALTRI PGIHHILEVE DRDYTDIHHI FEQTLEAYRE QLEGKTFCVR VKRRGKQDFN SQDVERYVGG GLNQHIESAR VKLSHPQVTV NLEIENDKLM LVKARREGIG GYPVGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FASSHKVRFI AIDFEPVVGE ILEKVEDGQM GVVLKRMMVR AASQIAERYG VQALVTGEAL GQVSSQTLTN LRLIDNASDT LILRPLISHD KEHIIKVARE IGTEDFAKTM PEYCGVISKS PTVKAIKAKI EEEEGHFDFS ILDRVVSEAK NVDIRTIAEQ TQEQVTEVET VAEFDADQVI LDIRSNDEQE DKPLKLDQVE VKPLPFYKLS TQFGDLDQSK TYLLYCERGV MSRLQALYLL EQGFTNVKVY RP
|
| |