Gene Sbal223_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3066 
Symbol 
ID7088976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3633873 
End bp3635549 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content50% 
IMG OID643461950 
ProductCarotenoid oxygenase 
Protein accessionYP_002358974 
Protein GI217974223 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.061004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000132751 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAGAC GTCAATTTCT AGCAGGCTCA GCCGCCCTAA TGACGACCGC TGTTTGGGCG 
AGTTCAGTGG ATAAATTACT GGCGCTGGAT AACGCACAAG CGGTCGCCCG TGAGCCATGG
GCCATAGGTT TTAATGGGGC GGAAGGCGAC TATCAGCCGC TCACTATGCA GGTTTCGGGC
GAGTGGCCAG CGCAATTGAG CGGCAATCTA TATCGTAATG GGCCAGCGAA AATGTCTCGG
GCAGGTAAGG CATATCAGCA TTGGTTCGAT GGCGATGGCA TGGTGCAGCA GTTTAGTGTG
GCCAATGGGC AAATTCAGCA TCAGGCTAAG TTTATCCAAA CAAAGAAGTT TAAAGTGGAG
GAGGCCGCTG GCGCGTTTAA GTACACCACC GCAGGCACAG TGATCCCAAA TGCTTTACCC
ATTCGCAATA ACGATGACTT GAATGTGGCT AACACCAGTT TATTACCGCG CCACGGCGAG
TTGCTCGCTT TGTGGGAAGC GGGTTCACCC TATCGACTCG ATGCGCAGAC CTTAACTACG
CTCGGTGTTA AAAGCTTTGG CGACAAGTTT AAGGGCCTAC CCTTTTCCGC CCATCCACTC
GATGATGGAC AAGGTGGACT GTGGAATTTT GGCGTCTGGT ATGTCGGCGG CGAGAAGCAA
TTACTTGTGT ATCAAGTGGC AGCCGATGAC AGCATTGCCA GAATGGAAGT GATTGAATTA
CAACAGGCGA GTTATTTACA TGCCTTTGCG CAGAGTGAGA ATAAGCTAGT GTTTTATATT
TCATCCTGCG TGTATGAAGA GGGCGAAACT TATATCGATG CTTTTAAATG GCGACCCGAA
TTGCCATCGC AATTACTGGT TATCGACAAA GCCGACTTTA AAGCGCGCCA GTTACACCCG
CTGCCTGCTG GATTTGTGTT TCACTTTGGT CAAGCCATTG AGAAGAATGA CGAACTGTCA
GTGCAACTTT GCCTCTATCC CAATTCGGCG ATTTTAACTC AAGGGATGAA GGGGCTACTG
ACGGGCGCCC GTAAAGCAAA AACACCCCAT GCCGAGTTAG TGACGATCAC AGTACCGCTG
ACGGCACCAC TTACAGCAGC TAGCAAAACG CAGTTTGCGG GGACGTTATC TCCCTCTGCA
CGTATTAACC GCAGCGGCGT GATGATGGAG TTTCCCCAGT TTGCGCAGCA CACTATGAGT
AACAAAGAGT CTGCACAGCA AACGAGCAAT GCCGAAAAGC TTGCAGAGAA AACGGGTGGG
GCTTCGGAAT TTGTCCCGTT ATTTGGCGTT GGAGCAAGAT CAGATTCTGC TGCTAACTCT
GGCTCCAGCT CTAACTTTAA ATATAAAGCA GAAGCCGAAT CTGAATCAGG ACTGAGTAAT
ACTCTGTATT GTATTCATAA TGCAAAAGAC AGCAGCCAGC ACGGTGTGGA TACACTTGGC
GACGCGAGCT ATAGCGCGTT TTATCTTGGC AAAGGCAAGA TTGCCGAAGA GCCGCTCTAT
ATCCCCGCCA CTGAGCAGCA TGAAGCCTAT TTATTGATGA CTTGGCTCGA TTACCACAAT
GCACAGTCTG GCTTGTCTTT GTTTCGTGCC AGTGACATCA GTGCGGGCCC CATCGCTTCG
GCTCAGATGA ACAGGGTATT GCCATTAGGG TTTCACGGTT GTTTCATTAA TGCCTAA
 
Protein sequence
MDRRQFLAGS AALMTTAVWA SSVDKLLALD NAQAVAREPW AIGFNGAEGD YQPLTMQVSG 
EWPAQLSGNL YRNGPAKMSR AGKAYQHWFD GDGMVQQFSV ANGQIQHQAK FIQTKKFKVE
EAAGAFKYTT AGTVIPNALP IRNNDDLNVA NTSLLPRHGE LLALWEAGSP YRLDAQTLTT
LGVKSFGDKF KGLPFSAHPL DDGQGGLWNF GVWYVGGEKQ LLVYQVAADD SIARMEVIEL
QQASYLHAFA QSENKLVFYI SSCVYEEGET YIDAFKWRPE LPSQLLVIDK ADFKARQLHP
LPAGFVFHFG QAIEKNDELS VQLCLYPNSA ILTQGMKGLL TGARKAKTPH AELVTITVPL
TAPLTAASKT QFAGTLSPSA RINRSGVMME FPQFAQHTMS NKESAQQTSN AEKLAEKTGG
ASEFVPLFGV GARSDSAANS GSSSNFKYKA EAESESGLSN TLYCIHNAKD SSQHGVDTLG
DASYSAFYLG KGKIAEEPLY IPATEQHEAY LLMTWLDYHN AQSGLSLFRA SDISAGPIAS
AQMNRVLPLG FHGCFINA