Gene Shewmr4_2726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2726 
Symbol 
ID4253297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3254212 
End bp3255666 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content48% 
IMG OID638119361 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_734854 
Protein GI113971061 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00293542 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.227571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TTGTAAAGCT GTACCCAGAA ATCATGATGA AGAGCAAGCC CGTGCGCATG 
CGCTTCACCA AAATGCTTGA AACCAACATC CGTAACGTGC TCAAAAAAGT TGATGAAGAT
GCCAAAGTGC AACGTCAATG GGACCGTATT TGGGTAAAGG TGCCAAATGA TAAACCTGAA
TTAGCTCAGG CCTTTGGTGA GCGTTTAGCC TGTATTCCTG GGATCGCCCA TGTGGTGCAA
GTGGATGAAT ACAGCTTTAC CTCAGTCGAC GATATCTACC AGCAAGTCTT ACCCGTTTAC
CGTGACCAAA TTGCCGGTAA AACCTTCTGT GTGCGCGTCA AACGTACTGG CTCCCACGAT
TTTAACTCTA TCGAAGTCGA GCGTTATGTC GGTGGTGGTT TAAACCAGTT TACCGATGCG
ATTGGCGTGC GTTTAAAGAA CCCAGAAGTG ACAGTTAACC TCGAAATCGA GGGCGATAAA
CTGTATATGG TGACTAAGCG TATCGAAGGC TTAGGCGGCT TCCCAATGGC GACACAGGAA
GATGTGTTGT CTTTGATTTC GGGCGGTTTT GACTCAGGCG TGTCGAGCTA CCAATTTATT
AAGAAGGGCG CTCGTACCCA TTACTGTTTC TTCAACCTCG GCGGTGCGCA GCATGAAATT
GGCGTGAAAC AAGTCGCTTA CCATTTGTGG AAAACCTATG GTGAATCCCA CAAGGTGAAG
TTTGTATCTG TGCCGTTCGA GCCTGTGGTG GCCGAGATTT TAGAGAAAAT CGACAACGGT
CAAATGGGCG TGGTGCTCAA GCGTATGATG ATGCGCACCG CGGCGCGTAT TGCCGAGCGT
ATGGGCATTC AGGCGATTGT GACAGGTGAG AGTTTAGGCC AAGTATCGAG CCAAACTTTA
ACCAATTTAA ACGTCATTGA CCGCTGCACC GATATGCTGA TCCTGCGCCC GCTGATCGCC
ATGGACAAGC AGGACATCAT CAACGAATGT CGCCGTATCG GTACCGAAGA TTTTGCTAAA
TCTATGCCTG AATATTGCGG CGTGATTTCG CAAAAGCCAA CCGTGAAGGC GGTACTGGCC
AAGGTTGAGG CCGAGGAGAC TAAATTCTCT GAGGACTTGA TTGACCGTAT CGTTGAGCAG
GCCGTGGTCA TTGATATCAG GGAGATAGCA GAACAAATGA ATACGCGTAT CACTGAAACT
GAAACCGTTG TTGCTATCGA CACCAATGAA GTGGTGATTG ATATTCGCGC CCCAGAGGAA
GAAGAGAACA AGCCGCTAGA GATTGAAGGC GTGGAAATCA AGCGCATTCC TTTCTTCAAA
TTAGCGACTC AGTTTGCCGA TCTCGATAAG CAGAAGACTT ACCTGTTGTA CTGTGAGCGT
GGTGTGATGA GTAAATTACA GGCGCTATAC CTGATTGAGC AAGGTTATCA TAATGTTAAG
GTCTACCGCC CTTAA
 
Protein sequence
MKFIVKLYPE IMMKSKPVRM RFTKMLETNI RNVLKKVDED AKVQRQWDRI WVKVPNDKPE 
LAQAFGERLA CIPGIAHVVQ VDEYSFTSVD DIYQQVLPVY RDQIAGKTFC VRVKRTGSHD
FNSIEVERYV GGGLNQFTDA IGVRLKNPEV TVNLEIEGDK LYMVTKRIEG LGGFPMATQE
DVLSLISGGF DSGVSSYQFI KKGARTHYCF FNLGGAQHEI GVKQVAYHLW KTYGESHKVK
FVSVPFEPVV AEILEKIDNG QMGVVLKRMM MRTAARIAER MGIQAIVTGE SLGQVSSQTL
TNLNVIDRCT DMLILRPLIA MDKQDIINEC RRIGTEDFAK SMPEYCGVIS QKPTVKAVLA
KVEAEETKFS EDLIDRIVEQ AVVIDIREIA EQMNTRITET ETVVAIDTNE VVIDIRAPEE
EENKPLEIEG VEIKRIPFFK LATQFADLDK QKTYLLYCER GVMSKLQALY LIEQGYHNVK
VYRP