Gene Sbal223_1732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1732 
Symbol 
ID7088266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2042587 
End bp2043747 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID643460636 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_002357660 
Protein GI217972909 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00561061 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAACA AGGCCAGATC CCCCGTAAGC GCCATATCGC ATTTGAAAAG 
GAAAACGGCG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGTTTTTC CAATATTTAT
TCCAATAAAT ATCACCACAA CATGCCAACC AAAGCGTTGG AAGTGGCTCC TTACAGCCTA
GGCCACGGTG CCAATTGGGA AGACTCACTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT
GCCGACCGTG AAGGTAATTT TTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGACTTA
GCACTGTATA CCGCCAAAGT CACCGCCGAT ACCAACGAGT TTTACCGTAA CGCCTATGCC
GATGAAGTGG TTTTTGTTCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT
AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGTGGCA CGACTTATCA ACTTAAATTC
AACGATTACA GCAATGTGCG ACTTTTCGTT ATCGAGTCAT TCTCTATGGT GGAAGTGCCT
AAGCATTTCC GTAACGAGTA TGGTCAGCTA TTAGAATCGG CACCTTACTG CGAGCGCGAT
TTGCGCGTGC CCACATTGCA AGATGCTGTG GTTGAGCGCG GCGCCTTCCC TTTAGTGTGT
AAGTTTGGCG ATAAGTACCA ACTCACCACA CTTGAATGGC ATCCCTTTGA TTTAGTGGGT
TGGGATGGCT GTGTTTACCC TTGGGCATTC AACATTACTG AATACGCGCC AAAGGTGGGC
AAAATCCATC TGCCGCCTTC TGATCACTTG GTCTTTACTG CCCATAACTT TGTGATTTGT
AACTTCGTGC CGCGCCCTTA CGATTTCCAC CCGAAATCGA TTCCTGCGCC TTATTACCAC
AACAACATCG ATAGCGACGA AGTTTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG
GGCATTGAAG CGGGTTACAT GACCTTACAT CAAAAAGGTG TGGCCCACGG CCCACAACCG
GGTCGCACTG AAGCCTCGAT TGGCAAGAAA GAAACCTATG AATACGCAGT GATGGTCGAC
ACGTTCGCCC CACTTAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC
CGCTCTTGGC TAGAAGACTA G
 
Protein sequence
MPFYVKQGQI PRKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL 
GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TNEFYRNAYA
DEVVFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP
KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG
WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD
TFAPLKLTEH VQNCMSKDYN RSWLED