Gene Sbal195_4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4290 
Symbol 
ID5756121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp5070309 
End bp5071409 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID641290646 
ProductO-succinylbenzoate synthase 
Protein accessionYP_001556708 
Protein GI160877392 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1441] O-succinylbenzoate synthase 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0281023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTTAA CTTCGCTTAA CCTGTATCAA TATCGACTAC CGCTCGATGT ATTATTGCCC 
GTAGGCAAGC AACGCATCGA CCACAGGGCT GGATTAGTAT TGCAAGCCTG CGCCAGCGCT
AATGTTGGTG GCGAATACAG ACAAGTCGAA GTTGAAATCG CCCCACTCTC TGGCATAGAT
ATCGATGAAC AAGTCTTAGT GGGATTTAGC CGCGAAACAC TGGCTCAAGT GCAAACGGCG
TTAACTGAGT TATTACCCCT ACTCTGCGGT CTGCATATCG ATACCCTGCT GGATTACGCC
GAGCAAAGTC CTTACCCTTC CTTGGCCTTT GGTTTAAGTC TGTTGCATGC CAAGTTAATG
GGTAAGCTCG ATGCTATCCG CCCACAAACC GCCACAGTCC CTTTGATTTA TCATCCAACG
GATGCGGGCA AAGACTTGCT CGATAACAAA GTTGCGGCGC TCGGCATGCA TGTTCATTCG
GTGAAAGTTA AAGTCGCGCA GACCTCAATC GAAGACGAAC TGAGCCTGAT TTATGGCATT
TTACGCACCC GCCCTGATCT CAAGCTACGT TTAGACGCAA ACCGAGGGTT TACACTGGAA
CAAGCGATCG AATTCGCTGC TTGCCTACCC CTGGACACCA TCGAATACAT AGAGGAACCT
TGCCAGAATC CACAGGATAA TCTGGCATTT TATCAAGCAA TTGGTATGCC TTACGCCCTC
GATGAATCAC TCAACGATCC CGATTATCAG TTTGCGATGC AAAATGGCCT GACTGCGCTC
ATCATCAAAC CTATGCTGCT CGGCAGTATT GAAAAGCTAG CTAACTTAAT CGATACGGCG
CAAAGCTATG GTGTGCGTTG CATTATCAGT TCAAGTCTTG AAAGCAGTTT AGGCATTAGC
GATCTGGCCC ATTTAGCCGC GATTCTCACG CCCGATGAAA TCCCAGGACT CGATACCCTC
AGCGCCTTTA GCCAAGATTT AATCGTGTCA TCGGGTAAAA AACACTGCCT CACACTCGCC
CAACTTGAAC TCATTGCACA ACACGCCATT GATTCAACGC AGCAAAGTGA TTTGGGTGCA
GCCAAGCGTG AGGAAAACTA G
 
Protein sequence
MILTSLNLYQ YRLPLDVLLP VGKQRIDHRA GLVLQACASA NVGGEYRQVE VEIAPLSGID 
IDEQVLVGFS RETLAQVQTA LTELLPLLCG LHIDTLLDYA EQSPYPSLAF GLSLLHAKLM
GKLDAIRPQT ATVPLIYHPT DAGKDLLDNK VAALGMHVHS VKVKVAQTSI EDELSLIYGI
LRTRPDLKLR LDANRGFTLE QAIEFAACLP LDTIEYIEEP CQNPQDNLAF YQAIGMPYAL
DESLNDPDYQ FAMQNGLTAL IIKPMLLGSI EKLANLIDTA QSYGVRCIIS SSLESSLGIS
DLAHLAAILT PDEIPGLDTL SAFSQDLIVS SGKKHCLTLA QLELIAQHAI DSTQQSDLGA
AKREEN