Gene Sbal_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_2023 
Symbol 
ID4844261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp2342995 
End bp2344947 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content48% 
IMG OID640119242 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001050394 
Protein GI126174245 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGGTA TTCATGGGGA TAATGCGCCC ATGAATACCG AGCATCCCGC GTTTGTTTGG 
ACCATAGCTG GTTCAGATAG TGGCGGTGGT GCCGGTATTC AGGCGGACTT AGCGACAATT
CAAGATTTGG GCTGTCACGG TTGCAGTGTG GTCACCACAG TGACGGCGCA AAGTTCTGTG
GCGGTCACTT TAGTCGAGCC CGTATCAGCG GCGATGTTAA TGGCTCAGCT GACGACTTTA
CTTTCCGACT TACCACCCAA AGCAATTAAA ATCGGCTTAC TGGCAGATCA AACTCAAGTG
GCATTGCTGG CAGATTGGAT CGCGAGTTTT AAAATCCACT ATCCATCTGT GCCTGTGATT
GTCGATCCTG TGATGGTCGC CAGTTGTGGC GATGCATTAG CAGTAGATAA CTGTCAGGAT
ATAAAAAGTG CGGCTAAATC AGCCTTAGAT TTTAAGCCTT TCAAAGGTTT AATCGAACTT
ATCACGCCCA ATGTGCTTGA ACTTGGGCGG TTAACTCACA GTGATGTTTC AACGAAAGCG
CAATTCGCTG CCGCGGCACT GGCCTTATCC CAGAGCCTTG ATTGCAGTGT GCTCGCCAAA
GGGGGCGATG TGAGCTTTGG CAGCACTGAC ATTCTTGATG ATACTCATGC TCAAACTCAC
GATAACACTT ATGCTCAAAC TCAGGCTAAC GTTCATGTTA TCGCTCTTGA TAGCAACGGC
TGGGACCTTG AGCTTGCCGA GGATTATCTA GTTTGTCGTC AAGTGCGCGC GAGCTCTAAA
CTACATCAAA ATGGGCGTTT CTGGTTAGCA AGCCAGCGGG TTAATACCCC TCATAACCAT
GGTAGCGGCT GTACTTTGTC ATCGGCCATT GCCGCCGTGT TAGCGCAGGG GTTTGTATTA
CAAGATGCGG TGGTGGTTGC CAAAGCCTAC GTGAGTCAAG GCTTAAGCGC GGCGATTGGT
TTAGGGCAAG GTCCAGGGCC GTTGGCCCGT ACGGGTTGGC CTAATGATGT GTCCCGCTAT
GCCAAGATAA ATCTGTGTGA TAGCAATTTT ATTAGTCATC AACTCAACCA ACACCTTGAT
GTTGGTAATG ATTTAGTTGC AACAGTTTTA TCCGCAACAG ATCAGGCAAC CGCTCAGGTA
AGAATAGCCT CGACGCAACC TCAAAATATT TTATCCCACG GTTTTAAAGT GCTCGATGCC
GATCTTGGTG TTTACCCCGT AGTGAATGAC TTAACCATGC TGGAGAGTTT GTTAGCTGCG
GGCGTTAAAA CCCTGCAGTT ACGGATAAAA ACCGACATCA GCGAGTTAAC TACTGCAGGG
TTAGCCGAAT CTGATTTAGG TAAATCTGCG CTAAGTAGAT GTGAGTCAGG CAAATCTAAG
TCAGGCGAAC CTGAGTTAAT TGGCTCCGAA TTAGAAGCAC AAATTCAAAC GGCCATTGCC
TTAGGTAAGC ATTTTAATGC GCAGCTTTTT ATCAATGATC ACTGGCAGTT AGCGATAAAA
TACCATGCCT TTGGGGTACA TTTAGGCCAA GAAGATCTCG CCGTTACCGA CTTAGCGGCC
ATTCAAGCCG CGGGGCTCGC GCTAGGCATA TCGAGCCACA GTTATTTCGA GCTGTTATTG
GCGCACCAAT ACTCGCCATC CTACATAGCG CTTGGGCATA TATTCCCAAC CACGACGAAG
CAAATGCCTT CGGCGCCCCA AGGGCTCGCA AAACTTAAAC ACTATGTGGC GTTACTCCAA
GACCATTATC CCTTGGTCGC CATTGGCGGT ATTGACTTAA CAAATCTGGC AAAGGTGAAA
GCAACGGGGG TGGGCAATAT TGCTGTGGTG CGCGCAATAA CGAAAGCTAA GGATCCGTTA
GCCGCCTTTG CAGAGTTGAG CCAAGCTTGG GAGCAATGTA GCTTGTCTGA AGAACTGGCT
GTAAAGCATG AGTTGGATGC AAAGCATGAG TAA
 
Protein sequence
MLGIHGDNAP MNTEHPAFVW TIAGSDSGGG AGIQADLATI QDLGCHGCSV VTTVTAQSSV 
AVTLVEPVSA AMLMAQLTTL LSDLPPKAIK IGLLADQTQV ALLADWIASF KIHYPSVPVI
VDPVMVASCG DALAVDNCQD IKSAAKSALD FKPFKGLIEL ITPNVLELGR LTHSDVSTKA
QFAAAALALS QSLDCSVLAK GGDVSFGSTD ILDDTHAQTH DNTYAQTQAN VHVIALDSNG
WDLELAEDYL VCRQVRASSK LHQNGRFWLA SQRVNTPHNH GSGCTLSSAI AAVLAQGFVL
QDAVVVAKAY VSQGLSAAIG LGQGPGPLAR TGWPNDVSRY AKINLCDSNF ISHQLNQHLD
VGNDLVATVL SATDQATAQV RIASTQPQNI LSHGFKVLDA DLGVYPVVND LTMLESLLAA
GVKTLQLRIK TDISELTTAG LAESDLGKSA LSRCESGKSK SGEPELIGSE LEAQIQTAIA
LGKHFNAQLF INDHWQLAIK YHAFGVHLGQ EDLAVTDLAA IQAAGLALGI SSHSYFELLL
AHQYSPSYIA LGHIFPTTTK QMPSAPQGLA KLKHYVALLQ DHYPLVAIGG IDLTNLAKVK
ATGVGNIAVV RAITKAKDPL AAFAELSQAW EQCSLSEELA VKHELDAKHE