Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_1671 |
Symbol | |
ID | 5079329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | - |
Start bp | 1903570 |
End bp | 1904730 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640498820 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001183195 |
Protein GI | 146292771 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.991879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTT ATGTAAAACA GGGGCAGATC CCCCATAAGC GCCATATCGC ATTTGAAAAG GAAAACGGTG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGCTTTTC CAATATTTAT TCCAATAAAT ATCACCACAA CATGCCCACC AAAGCATTAG AAGTGGCGCC CTACAGCCTA GGCCACGGTG CCAATTGGGA AGACTCGCTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT GCCGACCGTG AAGGTAATTT CTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGATTTA GCTCTCTATA CCGCCAAAGT CACCGCCGAT ACCGACGAAT TTTACCGTAA CGCCTATGCC GATGAAGTGG TTTTTGTCCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGTGGCA CCACTTATCA GCTTAAATTC AACGATTACA GCAATGTGCG ACTTTTCGTC ATCGAGTCAT TCTCTATGGT GGAAGTGCCT AAGCATTTCC GTAACGAGTA TGGTCAGCTG TTAGAATCGG CACCTTATTG CGAGCGCGAT TTGCGTGTGC CCACATTGCA AGATGCTGTA GTTGAGCGCG GCGCCTTCCC CTTAGTGTGT AAGTTTGGCG ATAAATACCA GCTCACCACA CTTGAATGGC ATCCCTTTGA TTTAGTGGGT TGGGATGGCT GCGTTTACCC TTGGGCATTC AACATCACAG AATACGCACC CAAGGTAGGT AAAATCCATT TACCACCTTC TGATCACTTA GTCTTTACCG CCCATAACTT TGTGATTTGT AACTTCGTGC CACGCCCCTA CGATTTCCAC CCAAAATCGA TTCCCGCGCC TTATTACCAC AACAATATAG ATAGCGACGA AGTTTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG GGCATTGAAG CGGGTTATAT GACCTTACAT CAAAAAGGTG TGGCCCATGG CCCGCAACCG GGTCGCACCG AAGCCTCTAT TGGTAAAAAA GAAACCTATG AATATGCAGT GATGGTGGAC ACATTCGCCC CACTGAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC CGCTCTTGGC TAGAAGACTA G
|
Protein sequence | MPFYVKQGQI PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TDEFYRNAYA DEVVFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD TFAPLKLTEH VQNCMSKDYN RSWLED
|
| |