Gene Sbal195_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_3301 
Symbol 
ID5755105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp3891468 
End bp3892571 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content50% 
IMG OID641289634 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_001555723 
Protein GI160876407 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00274465 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000125479 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGCTGC ACAGTATAGA AGAGATCATC GAAGATATTC GTCAAGGCAA AATGGTTATT 
TTGATGGATG ACGAAGACAG AGAAAACGAA GGTGATTTGA TCATGGCGGC CGAAATGGTA
ACGCCAGAAG CGATTAACTT TATGGCGAAA TATGGCCGTG GACTCATTTG CCAGACCATG
ACTAAAGCCC GTTGCCAGCA GTTAAATCTG CCCTTAATGG TGACGAATAA CAACGCCCAG
TTCTCGACTA ACTTTACAGT TTCTATTGAA GCAGCCGAAG GCGTGACTAC CGGTATTTCG
GCCCACGACC GCGCGGTAAC GGTAAAAACG GCCGTGGCTA AAGAGGCTAA AGCGTCTGAT
TTAGTGCAAC CAGGGCATAT CTTCCCGTTA ATGGCACAGG ACGGCGGCGT ATTAACCCGC
GCAGGGCACA CTGAAGCTGG TTGTGATTTA GCCCGTCTTG CGGGACTTGA GCCATCGGGC
GTTATCGTTG AGATTTTGAA CGAAGACGGC ACTATGGCAC GCCGCCCAGA TTTAGAGATT
TTCTCCGAGT TGCACGGCAT CAAAATCGGC ACCATCGCGG CATTGATCGA GTATCGCAAC
ACCAAAGAAA CCACGGTTGT GCGTGAAGCT AAATGCAAAC TACCGACCCG TTTCGGTGAG
TTCGACATGG TGACTTTCAG AGACACTATC GACAATCAAC TGCATTTTGC CTTAGTCAAA
GGTGAGGTGA AGAGTGATTG TTTAGTGCGC GTGCATCTGC AAAACACTTT CAACGATTTA
CTCCATTCAG AGCGCGATCA GCAACGCAGC TGGCCACTCG AAAAGGCGAT GGAGCGTATT
TCTGCAGAAG GTGGCGTATT GGTTTTATTA GGGAATCAAG AACATCCCTG TGAAATCCTC
TCTAAGGTGA AAGCCTTTGA AGCCGAAGAT CAAGGTCAAG CGCCTGCTTC TGCAAAATGG
CAGGGGACGT CGCGCCGTGT GGGTGTGGGT TCGCAAATCC TCGCTAGCCT TGGCGTGACT
AAGATGCGCC TGCTCAGCTC GCCTAAACGT TACCATTCAC TTTCGGGCTT TGGCCTTGAA
GTGACTGAGT ATGTGGCGGA CTAA
 
Protein sequence
MALHSIEEII EDIRQGKMVI LMDDEDRENE GDLIMAAEMV TPEAINFMAK YGRGLICQTM 
TKARCQQLNL PLMVTNNNAQ FSTNFTVSIE AAEGVTTGIS AHDRAVTVKT AVAKEAKASD
LVQPGHIFPL MAQDGGVLTR AGHTEAGCDL ARLAGLEPSG VIVEILNEDG TMARRPDLEI
FSELHGIKIG TIAALIEYRN TKETTVVREA KCKLPTRFGE FDMVTFRDTI DNQLHFALVK
GEVKSDCLVR VHLQNTFNDL LHSERDQQRS WPLEKAMERI SAEGGVLVLL GNQEHPCEIL
SKVKAFEAED QGQAPASAKW QGTSRRVGVG SQILASLGVT KMRLLSSPKR YHSLSGFGLE
VTEYVAD