Gene Sputcn32_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_1671 
Symbol 
ID5079329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp1903570 
End bp1904730 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID640498820 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001183195 
Protein GI146292771 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.991879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTT ATGTAAAACA GGGGCAGATC CCCCATAAGC GCCATATCGC ATTTGAAAAG 
GAAAACGGTG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGCTTTTC CAATATTTAT
TCCAATAAAT ATCACCACAA CATGCCCACC AAAGCATTAG AAGTGGCGCC CTACAGCCTA
GGCCACGGTG CCAATTGGGA AGACTCGCTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT
GCCGACCGTG AAGGTAATTT CTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGATTTA
GCTCTCTATA CCGCCAAAGT CACCGCCGAT ACCGACGAAT TTTACCGTAA CGCCTATGCC
GATGAAGTGG TTTTTGTCCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT
AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGTGGCA CCACTTATCA GCTTAAATTC
AACGATTACA GCAATGTGCG ACTTTTCGTC ATCGAGTCAT TCTCTATGGT GGAAGTGCCT
AAGCATTTCC GTAACGAGTA TGGTCAGCTG TTAGAATCGG CACCTTATTG CGAGCGCGAT
TTGCGTGTGC CCACATTGCA AGATGCTGTA GTTGAGCGCG GCGCCTTCCC CTTAGTGTGT
AAGTTTGGCG ATAAATACCA GCTCACCACA CTTGAATGGC ATCCCTTTGA TTTAGTGGGT
TGGGATGGCT GCGTTTACCC TTGGGCATTC AACATCACAG AATACGCACC CAAGGTAGGT
AAAATCCATT TACCACCTTC TGATCACTTA GTCTTTACCG CCCATAACTT TGTGATTTGT
AACTTCGTGC CACGCCCCTA CGATTTCCAC CCAAAATCGA TTCCCGCGCC TTATTACCAC
AACAATATAG ATAGCGACGA AGTTTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG
GGCATTGAAG CGGGTTATAT GACCTTACAT CAAAAAGGTG TGGCCCATGG CCCGCAACCG
GGTCGCACCG AAGCCTCTAT TGGTAAAAAA GAAACCTATG AATATGCAGT GATGGTGGAC
ACATTCGCCC CACTGAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC
CGCTCTTGGC TAGAAGACTA G
 
Protein sequence
MPFYVKQGQI PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL 
GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TDEFYRNAYA
DEVVFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP
KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG
WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD
TFAPLKLTEH VQNCMSKDYN RSWLED