Gene Sbal195_2728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2728 
Symbol 
ID5754501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp3234528 
End bp3235688 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content48% 
IMG OID641289036 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_001555156 
Protein GI160875840 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.315515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.247333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAACA AGGCCAGATC CCCCATAAGC GCCATATCGC ATTTGAAAAG 
GAGAACGGCG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGTTTTTC CAATATTTAT
TCCAATAAAT ATCACCACAA CATGCCAACC AAAGCGTTGG AAGTGGCTCC TTACAGCCTA
GGCCACGGTG CCAATTGGGA AGACTCGCTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT
GCCGACCGTG AAGGTAATTT TTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGACTTA
GCACTTTATA CCGCCAAAGT CACCGCCGAT ACCGACGAGT TTTACCGTAA TGCCTATGCC
GATGAGGTGC TTTTTGTCCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT
AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGCGGCA CGACTTATCA ACTTAAATTC
AACGATTACA GCAATGTGCG ACTTTTCGTT ATCGAGTCAT TCTCTATGGT GGAAGTGCCT
AAGCATTTCC GTAACGAGTA TGGTCAGCTA TTAGAATCGG CACCTTACTG CGAGCGCGAT
TTGCGCGTGC CCACATTGCA AGATGCTGTG GTTGAGCGCG GCGCCTTCCC CTTAGTATGT
AAGTTTGGCG ATAAGTACCA ACTCACCACA CTAGAATGGC ATCCCTTTGA TTTAGTGGGT
TGGGATGGCT GTGCTTACCC TTGGGCATTC AACATTACTG AATACGCGCC AAAGGTGGGC
AAAATCCATC TGCCGCCTTC TGATCACTTA GTCTTTACCG CCCATAACTT TGTGATTTGT
AACTTCGTGC CGCGTCCCTA CGATTTCCAC CCGAAATCGA TTCCGGCGCC TTATTACCAC
AACAATATCG ATAGCGACGA AGTCTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG
GGCATTGAAG CGGGTTATAT GACCTTACAT CAAAAAGGTG TGGCCCACGG CCCACAACCG
GGCCGCACTG AAGCCTCGAT TGGCAAGAAA GAAACCTATG AATACGCAGT GATGGTCGAC
ACCTTCGCCC CACTTAAATT AACCGAACAT GTACAACATT GCATGAGCAA AGACTACAAC
CGCTCTTGGC TAGAAGACTA G
 
Protein sequence
MPFYVKQGQI PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL 
GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TDEFYRNAYA
DEVLFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP
KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG
WDGCAYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD
TFAPLKLTEH VQHCMSKDYN RSWLED