Gene Sbal223_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2998 
Symbol 
ID7089066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3539906 
End bp3541360 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content45% 
IMG OID643461883 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002358907 
Protein GI217974156 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.174595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000707392 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTTTA TTGTAAAGCT GTACCCAGAA ATCATGATGA AGAGCAAACC CGTAAGGATG 
CGCTTCACCA AAATGCTTGA AACCAACATA CGTAATGTGC TCAAAAAAGT CGACGAAGAC
GCTAAAGTTC AACGTCAGTG GGACCGCATT ATGGTTATGG TACCAAAGGA TAAGCCTGAA
TTGGCCCAAG CCTTCGGTGA GCGTCTGGCG TGTATTCCTG GCATTGCCCA CGTAGTGCAA
GTGGATGAGT ACAGCTTTGA ATCTGTGGAC GATATTTATC AGCAAGTCTT GCCTGTTTAC
CGTGACCAAC TCGCGGGTAA AACCTTCTGC GTGCGAGTGA AGCGCACTGG CGATCACGAT
TTTAATTCTA TCGAAGTGGA GCGTTACGTC GGCGGCGGCT TAAATCAGTT TACCGATGCT
TTAGGTGTGC GTTTAAAGAA TCCTGATATT ACTGTTAACC TTGAAATTGA ACGCGATAAT
CTTTACATGG TCACTAAGCG TATTGAAGGC TTAGGTGGTT TCCCGATGGC GACACAGGAA
GATGTGTTGT CTTTGATTTC AGGTGGTTTT GACTCTGGTG TGTCTAGCTA TCAATTTATC
AAAAAGGGCG CGCGTACCCA TTACTGCTTC TTCAATCTGG GCGGCGCACA GCATGAAATT
GGCGTGAAGC AAGTGGCTTA CCATTTGTGG AAAACCTACG GCGAATCCCA CAAGGTGAAG
TTTATCTCTG TGCCTTTTGA GCCTGTGGTC GCTGAGATTT TAGAGCGTAT CGATAATGGC
CAAATGGGCG TTGTGCTTAA GCGTATGATG ATGCGCACCG CGGCGCGTAT TGCCGACCGT
ATGGGCATTC AAGCCTTAGT AACCGGTGAA AGTTTAGGTC AAGTTTCTAG CCAAACGCTG
ACCAACTTAA ACGTGATTGA CCGCTGTACT GAGCTGCTGA TTTTACGTCC GCTGATTGCC
ATGGATAAAC AAGACATTAT CAACGAGAGC CGTAAAATCG GCACCGAAGA TTTTGCCAAG
TCTATGCCTG AATATTGTGG CGTGATTTCG CAAAAGCCAA CGGTTAAAGC GGTATTAGCT
AAAGTAGAAG CGGAAGAGAA GAAGTTCTCT GAAGATTTGA TCGATCAAAT CATCGCCCAG
TCTGTCACTA TTGATATCAG GGAGATCGCA GAACAAATGG ATACTCGGAT TACTGAGACT
GAAACGGTCG CCAGTATTGA TACCAACCAA GTGGTCATTG ATATTCGCGC CCCTGAAGAA
GAGGAAAGCA AACCGTTGCA AATTGAAGGC ATTGAGATTA AACGTATCCC ATTCTTCAAG
TTAGCGACTC AATTTGCCGA TCTCGATAAG CAAAAGACTT ATTTGCTGTA TTGTGAGCGT
GGGGTGATGA GTAAATTGCA GGCATTGTAT TTAATCGAGC AAGGCTACAC TAACGTCAAA
GTGTATCGCC CATAA
 
Protein sequence
MKFIVKLYPE IMMKSKPVRM RFTKMLETNI RNVLKKVDED AKVQRQWDRI MVMVPKDKPE 
LAQAFGERLA CIPGIAHVVQ VDEYSFESVD DIYQQVLPVY RDQLAGKTFC VRVKRTGDHD
FNSIEVERYV GGGLNQFTDA LGVRLKNPDI TVNLEIERDN LYMVTKRIEG LGGFPMATQE
DVLSLISGGF DSGVSSYQFI KKGARTHYCF FNLGGAQHEI GVKQVAYHLW KTYGESHKVK
FISVPFEPVV AEILERIDNG QMGVVLKRMM MRTAARIADR MGIQALVTGE SLGQVSSQTL
TNLNVIDRCT ELLILRPLIA MDKQDIINES RKIGTEDFAK SMPEYCGVIS QKPTVKAVLA
KVEAEEKKFS EDLIDQIIAQ SVTIDIREIA EQMDTRITET ETVASIDTNQ VVIDIRAPEE
EESKPLQIEG IEIKRIPFFK LATQFADLDK QKTYLLYCER GVMSKLQALY LIEQGYTNVK
VYRP