Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_2754 |
Symbol | |
ID | 5663147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | - |
Start bp | 3365183 |
End bp | 3368101 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641237396 |
Product | collagenase |
Protein accession | YP_001502609 |
Protein GI | 157962575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATCA AATCATCTAA ATCACAAGCT AAAAAGGGTC AGTTTAAAGT TCATGCTTTA GCCTTGGCTT GTTCAGCTAT TCTGTTTCCT AGCGCTGCGT TAGCAACATC TGAAGTCGCA CCAACTGAAA AGGCTTCATC TAAAGCCGCT CACGGAAAAG ATCTGAAAAG TCGCTTTGAA AAGGGCGCTA ATGTGCCTGG TAAGCAAAAA AATGCTGAAA AGACTCAACG TCAGAATATC GATAAGAGTA AAGAACAGAA AGTTGAAGGA CCAATGGGGG CTGTTGGTTC CGTAGGTCCT GCAGCTGAAT CTGAAGCAGA GGTATGTAGT AGTGAGCTTG CAGCATTAAG TGGCCAAGAG TTATTTAATT ATGTGAGAGA AGCCGATATT TCTTGTATCT CTGCGCTTTA TTCACGTAAC GATGCAGTAT CTGTTGCAGC TTACCAGACT GAAAATGTGG TTTCGGTTGC TAATCAAGCG GCCTCTATGG CAGCGACTTA CGACAGCAGC ACAGGTTATG AAATGCGTAA CTTGTTCTAC TTCTTACGCG GTGCATTTTA CATTGAGTTC TATAACGACG ACCTAACTTA TAGTGATACC CTAGCCGCTG ATGCCGTTTA CGGTGCGTTA GTTGAGTATT CAAAGAACCC TAAGCTTTTT GAGATCACTC ATTCTGCTGG TGACACTTTA ATGGAGTTTT TCACTTCATG GTCGAGCGCC GATCGAATCC TTGAAAGCGT GCCTTTAATT ACAGATTACC TGCAGATGTT TAATGCTGAC TTCTTAGCAA GTAATCGCCA CCGCGCAGCG ATGACAAGTG CACTGACTAC ACTTTATTAC GGTAGTTGGG CTGAGGATTA CAATAAGCTA GCCATGGAGC ATGGTGAGCT AATAGATGCG CTACTGAATA TTGCGACTGC TGATTATATT ATTAACTCTG ATTACCAGTA TGAATCGACA GATGCTTTTC ATGAGTTTGG TCGCTTCTAT GAGTATCAAG CGTATTGGGA TCTACCACAA AGCTTAAAAA CACGCCTAAA TGACGGTGTA GAGCTCTACA TGAGCAAGTT TGAACGTATG TCTGCTGAGT GGGCTGACGG TGCAGGCTAT CTAGATTATT ATAACCCAGG TGATTGCGAA CAGTTTGGGA TATGTGGCTG GGAAGAGGAG CTTGAACAGA CGGCATTACC TATCAACTAC AACTGTAGTG ATACCATCTA TATTCGTGCT CAGCAGTTAA CTAATGATGA GCTACAAGCC TCTTGTGATC TTATGGGAGG AGAAGAAACA CTGTTCCATA ATGTACTCGC AACGGGTTAC CAGCCAGTTG CCGATGATTT GAATGAAAGC TTAGAAGTTA ATATTTTCGA TAGCTACGAT GACTATGCCC AGTATGCCGG TGTGATTTTT GGTATTAGTA CTGACAATGG TGGTATGTAC CTAGAGGGGA CACCTTCACA AGAAGGTAAT CAGGCGCGTT TTATTGCGCA TGAAGCAACC TGGACTGAAG ATATTTTAGT TTGGAACCTT AAGCACGAAT ATGTTCATTA TCTAGATGGC CGCTTTAATC AATATGGTGC GTTTAACTAC TTTGATATCG ATACGGGTAA ATCTGTTTGG TGGTCAGAAG GTCTAGCCGA GTACATATCT AAGCAGAACC GTAATGACGG TGCTGTCGAT ATGGGGCGCT CTCAGGCCTA TAGCCTCAGC GAAATCCTAG CGAATACCTA TAACAGTGGT TCAGATCGCG TTTATAGCTG GGGTTATCTA GCGGTGCGTT TCCTATTTGA AAACCACAGA AGTGATGTTG ACGAATTATT AGTGCTAGCA CGTGGTGGTG ACGCCGATGG TTGGTTGGCT TATATCGATA ACACTATTGG TCAAAACTAT AACAGCGAGT GGAACACTTG GTTGACTTCA GTGACCAGTA ATGATGAGTC AATTGCTATT GATATAACTG ATCCAATTGA TTCAGATGGT GACGGCGTAG CCGATGACCA AGACGCTTTC CCCCATGATC CGAGTGAAAC CCGCGATACT GATGGTGACG GTGTTGGTGA TAATGCAGAT GCGTTCCCAA CCGATGCTAG CGAAACGGTC GATACCGACG GTGATGGCGT GGGTGACAAC GGTGATGTAT TCCCAACAGA TCCTACTGAG TGGGCAGATT CGGATGGTGA TGGCATCGGT GATAATGCAG ATACTGATGA GCCAAATAAG CCTGTCGAGC ATTGCGGCGT AGCAACCATT AGAGATGGCA AGCTAACTCA AGATAAAGTT GAGTGTGTTG AAGGTACAGA TATTAACTAT TTCTACACCT ATGTAGAGGA AGATAACACG CCGCTTTATA TCAGCACCTC TGGCGGTGAA GGTGATGTGG ATCTGTTCTT CAATCAAGAT ATCTGGGCTA AGCCATCGGA CTTTGATGCA AGATCTGAAA ACGATGGTAA CGAAGAATCG CTAAGCGTGA TTGCTAATAG TGGTTGGGTA TATGTGAGCC TACTCGCACA CCAAGACTTT GACGGTGTCA GCCTTAAGGT TAGCCTAACA GAGGAGGGCG ATACAGGCAC AACTCCTGTA GCAGTTGCTG ATGCATGTGC AACACAATCT CCATATAGCT ATGGTGGCGT TGAGTTTGGC GAGGCTATCT GTATCGAAGA TGGTCACTCA AGCTACTACT TCTATGTGCC TACAGGTACT GAAGAGGTTA CTGTTGCAAG TGGTCATGGT AGCGGTAATG TGAATCTATA TGGTAATAGC CAGACTTGGG CTGGTCCTGA CTCATTCGAA GTTAAGTCTG AAAATGCGGA TAATGTTGAA AGCCTAACAA TCTCAGCTCC AGCAGAAGGT TGGTACTACA TCAGTGCCGA TGGTGCGCCA TCGAGCGAAG GTGCAAGCTT GGTGGTTAAC ATTAAGTAA
|
Protein sequence | MSIKSSKSQA KKGQFKVHAL ALACSAILFP SAALATSEVA PTEKASSKAA HGKDLKSRFE KGANVPGKQK NAEKTQRQNI DKSKEQKVEG PMGAVGSVGP AAESEAEVCS SELAALSGQE LFNYVREADI SCISALYSRN DAVSVAAYQT ENVVSVANQA ASMAATYDSS TGYEMRNLFY FLRGAFYIEF YNDDLTYSDT LAADAVYGAL VEYSKNPKLF EITHSAGDTL MEFFTSWSSA DRILESVPLI TDYLQMFNAD FLASNRHRAA MTSALTTLYY GSWAEDYNKL AMEHGELIDA LLNIATADYI INSDYQYEST DAFHEFGRFY EYQAYWDLPQ SLKTRLNDGV ELYMSKFERM SAEWADGAGY LDYYNPGDCE QFGICGWEEE LEQTALPINY NCSDTIYIRA QQLTNDELQA SCDLMGGEET LFHNVLATGY QPVADDLNES LEVNIFDSYD DYAQYAGVIF GISTDNGGMY LEGTPSQEGN QARFIAHEAT WTEDILVWNL KHEYVHYLDG RFNQYGAFNY FDIDTGKSVW WSEGLAEYIS KQNRNDGAVD MGRSQAYSLS EILANTYNSG SDRVYSWGYL AVRFLFENHR SDVDELLVLA RGGDADGWLA YIDNTIGQNY NSEWNTWLTS VTSNDESIAI DITDPIDSDG DGVADDQDAF PHDPSETRDT DGDGVGDNAD AFPTDASETV DTDGDGVGDN GDVFPTDPTE WADSDGDGIG DNADTDEPNK PVEHCGVATI RDGKLTQDKV ECVEGTDINY FYTYVEEDNT PLYISTSGGE GDVDLFFNQD IWAKPSDFDA RSENDGNEES LSVIANSGWV YVSLLAHQDF DGVSLKVSLT EEGDTGTTPV AVADACATQS PYSYGGVEFG EAICIEDGHS SYYFYVPTGT EEVTVASGHG SGNVNLYGNS QTWAGPDSFE VKSENADNVE SLTISAPAEG WYYISADGAP SSEGASLVVN IK
|
| |