Gene Swoo_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwoo_3473 
Symbol 
ID6117805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella woodyi ATCC 51908 
KingdomBacteria 
Replicon accessionNC_010506 
Strand
Start bp4247702 
End bp4249156 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content44% 
IMG OID641635026 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001761834 
Protein GI170727808 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0256811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000477485 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTTA TCGTAAAACT GTTCCCTGAA ATCATGATGA AAAGCAAACC GGTAAGAATG 
CGATTTACCA AGATGCTTGA AACCAATATT CGTAATGTAC TTAAGAAGGT CGATGAGACC
GCTAAAGTGA AGCGTGAGTG GGACAAAATC ATGGTGTTGG TGCCAGACGA TAGACAAGAC
TTAGTGGAGG CTTTCGCTGA GCGTCTTGCT TGTATTCCTG GTATCGCTCA TGTGCTTCAG
GTAAGCGAGA GCACATTCGA ATCGGTTGAT GATATCTATC AACAAACTTT AGCGGTTTAT
AAAGATGAAC TTGCGGGTAA AACCTTTTGT GTGCGGGTTA AGCGTGTCGG TAATCATGAT
TTTAGATCCA TCGAAGTTGA GCGTTATGTC GGTGGTGGGC TAAATCAGTT TACTGAGGCG
GCAGGTGTTA GATTAAAAAA TCCTGATATG ACGATTAATC TTGAGATAGA TAAAGAGAGT
CTATACCTAG TAAGTAAACG CATTGAGGGC TTAGGTGGTT ACCCTATGGC GACTCAGGAA
GATGTATTGT CGCTTATCTC TGGCGGATTT GATTCAGGTG TTTCCAGTTA TCAGTTTATT
AAACGTGGCT CTCGTACCCA CTACTGCTTC TTCAACTTAG GTGGCGATCA GCATGAGATA
GGTGTGAAGC AGGTGGCTTA TCATCTATGG CAGAAATATG GTGAGTCCCA TAAGGTCAAG
TTTATCTCAA TCCCCTTTGA TCCTGTGGTA ACAGAGATCC TTGAGAAGAT AGATAATGGC
CAGATGGGGG TTATTCTCAA GCGTATGATG ATGCGAGCTG CGGCTAAAGT TGCCAGAAAG
ATGGGAATAC AAGCGTTAGT AACTGGTGAG GCGATGGGGC AAGTATCAAG CCAAACCTTG
ACTAACCTAA GCATCATCGA CCGATGCACA GATCAGCTTA TTCTACGTCC ACTCATCGCC
ATGGATAAGC AGGACATTAT TAACTTAAGT CGTAAAATTG GTACAGAAGA TCTCTCTAAG
TCTATCCCCG AGTATTGCGG TGTGATCTCT CAGCGCCCAA CGGTTAAGGC TGTACTCTCT
AAGATTGAAG CTGAGGAGCT GAAGTTTTCT GAAGACCTTA TTGACCGTGT TATTGAAGCT
GCTGAGGTTA TCGATATCAG AGAGATAGCT ACTAGTATGG ACACTAAGAT CACCTCAACG
GAAACCGTTG GCGATATTAA CTCAGGTGAA GTGATCATCG ACGTTCGTGC TCCAGAAGAG
GAGGAGCAGT CACCGCTTGA GGTTGAGGGC GTTGAAGTGA AGGCGATCCC TTTCTTTAGA
TTAGCGACTA AGTTTGCCGA TTTGGATAAG AGTAAAACTT ATCTGCTTTA CTGCGACAGA
GGCGTGATGA GTAAGCTTCA GGCGCTATAC CTTCAAGAAC AAGGCTATGA TAACGTTAAA
GTTTACCGCC CTTAA
 
Protein sequence
MKFIVKLFPE IMMKSKPVRM RFTKMLETNI RNVLKKVDET AKVKREWDKI MVLVPDDRQD 
LVEAFAERLA CIPGIAHVLQ VSESTFESVD DIYQQTLAVY KDELAGKTFC VRVKRVGNHD
FRSIEVERYV GGGLNQFTEA AGVRLKNPDM TINLEIDKES LYLVSKRIEG LGGYPMATQE
DVLSLISGGF DSGVSSYQFI KRGSRTHYCF FNLGGDQHEI GVKQVAYHLW QKYGESHKVK
FISIPFDPVV TEILEKIDNG QMGVILKRMM MRAAAKVARK MGIQALVTGE AMGQVSSQTL
TNLSIIDRCT DQLILRPLIA MDKQDIINLS RKIGTEDLSK SIPEYCGVIS QRPTVKAVLS
KIEAEELKFS EDLIDRVIEA AEVIDIREIA TSMDTKITST ETVGDINSGE VIIDVRAPEE
EEQSPLEVEG VEVKAIPFFR LATKFADLDK SKTYLLYCDR GVMSKLQALY LQEQGYDNVK
VYRP