Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2726 |
Symbol | |
ID | 4253297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 3254212 |
End bp | 3255666 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 638119361 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_734854 |
Protein GI | 113971061 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00293542 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.227571 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTA TTGTAAAGCT GTACCCAGAA ATCATGATGA AGAGCAAGCC CGTGCGCATG CGCTTCACCA AAATGCTTGA AACCAACATC CGTAACGTGC TCAAAAAAGT TGATGAAGAT GCCAAAGTGC AACGTCAATG GGACCGTATT TGGGTAAAGG TGCCAAATGA TAAACCTGAA TTAGCTCAGG CCTTTGGTGA GCGTTTAGCC TGTATTCCTG GGATCGCCCA TGTGGTGCAA GTGGATGAAT ACAGCTTTAC CTCAGTCGAC GATATCTACC AGCAAGTCTT ACCCGTTTAC CGTGACCAAA TTGCCGGTAA AACCTTCTGT GTGCGCGTCA AACGTACTGG CTCCCACGAT TTTAACTCTA TCGAAGTCGA GCGTTATGTC GGTGGTGGTT TAAACCAGTT TACCGATGCG ATTGGCGTGC GTTTAAAGAA CCCAGAAGTG ACAGTTAACC TCGAAATCGA GGGCGATAAA CTGTATATGG TGACTAAGCG TATCGAAGGC TTAGGCGGCT TCCCAATGGC GACACAGGAA GATGTGTTGT CTTTGATTTC GGGCGGTTTT GACTCAGGCG TGTCGAGCTA CCAATTTATT AAGAAGGGCG CTCGTACCCA TTACTGTTTC TTCAACCTCG GCGGTGCGCA GCATGAAATT GGCGTGAAAC AAGTCGCTTA CCATTTGTGG AAAACCTATG GTGAATCCCA CAAGGTGAAG TTTGTATCTG TGCCGTTCGA GCCTGTGGTG GCCGAGATTT TAGAGAAAAT CGACAACGGT CAAATGGGCG TGGTGCTCAA GCGTATGATG ATGCGCACCG CGGCGCGTAT TGCCGAGCGT ATGGGCATTC AGGCGATTGT GACAGGTGAG AGTTTAGGCC AAGTATCGAG CCAAACTTTA ACCAATTTAA ACGTCATTGA CCGCTGCACC GATATGCTGA TCCTGCGCCC GCTGATCGCC ATGGACAAGC AGGACATCAT CAACGAATGT CGCCGTATCG GTACCGAAGA TTTTGCTAAA TCTATGCCTG AATATTGCGG CGTGATTTCG CAAAAGCCAA CCGTGAAGGC GGTACTGGCC AAGGTTGAGG CCGAGGAGAC TAAATTCTCT GAGGACTTGA TTGACCGTAT CGTTGAGCAG GCCGTGGTCA TTGATATCAG GGAGATAGCA GAACAAATGA ATACGCGTAT CACTGAAACT GAAACCGTTG TTGCTATCGA CACCAATGAA GTGGTGATTG ATATTCGCGC CCCAGAGGAA GAAGAGAACA AGCCGCTAGA GATTGAAGGC GTGGAAATCA AGCGCATTCC TTTCTTCAAA TTAGCGACTC AGTTTGCCGA TCTCGATAAG CAGAAGACTT ACCTGTTGTA CTGTGAGCGT GGTGTGATGA GTAAATTACA GGCGCTATAC CTGATTGAGC AAGGTTATCA TAATGTTAAG GTCTACCGCC CTTAA
|
Protein sequence | MKFIVKLYPE IMMKSKPVRM RFTKMLETNI RNVLKKVDED AKVQRQWDRI WVKVPNDKPE LAQAFGERLA CIPGIAHVVQ VDEYSFTSVD DIYQQVLPVY RDQIAGKTFC VRVKRTGSHD FNSIEVERYV GGGLNQFTDA IGVRLKNPEV TVNLEIEGDK LYMVTKRIEG LGGFPMATQE DVLSLISGGF DSGVSSYQFI KKGARTHYCF FNLGGAQHEI GVKQVAYHLW KTYGESHKVK FVSVPFEPVV AEILEKIDNG QMGVVLKRMM MRTAARIAER MGIQAIVTGE SLGQVSSQTL TNLNVIDRCT DMLILRPLIA MDKQDIINEC RRIGTEDFAK SMPEYCGVIS QKPTVKAVLA KVEAEETKFS EDLIDRIVEQ AVVIDIREIA EQMNTRITET ETVVAIDTNE VVIDIRAPEE EENKPLEIEG VEIKRIPFFK LATQFADLDK QKTYLLYCER GVMSKLQALY LIEQGYHNVK VYRP
|
| |