Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_1685 |
Symbol | |
ID | 4920448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | + |
Start bp | 1935281 |
End bp | 1938187 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640163247 |
Product | collagenase |
Protein accession | YP_001093811 |
Protein GI | 127512614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000629981 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000468189 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGACAACCA ATATGACAAA TAAAACATTA GGCGGCGCTC GCCGCTTCAC CCCAACCGCG CTGGCCTTGG CCTGCTCGCT ACTGGCCACC CCGCTGGTGC TGCACGCTCA AGATGTTCAG CCACAAGGCA TGCCGGCAAA GGATGTAAAG GGCCCGCTTT CGATAAAAGC CGCACCGTCG ACAAAAGCCG CGCCGCCGAC GAAAGCCGCA CCTGCCCAGA AGGTGAAGCC GTCTGCCGAC AAGGCCGCTG CGAGTTCTCG ATTCGACACG TCTGCGCGCA AGCCCATAGG GGCACGTGAT GCGGAGCAGG GCGAAGGCGT GAGGCCTGAG AAGCGTGATA GCAAATTGGA GAAGGGGAGT AAGGGGCAAT CGGCGCTGCC AGGCATGAAG GGCAATCAAG GCGCGACCGG GCCTGCGGCC GAGGCGGATG CCTGTAATGG TTACAGCCAG TTCGACAGCT TGAGTGGTCA GGCGCTGTAC GATTTTGTTC GCCAGGCGGA GTTTTCCTGT ATCTCCGAGC TCTATTCTCA CAACGATGCC ACCGCGGTCA AGGTGTATCA GGCCGACAAT GTGGTGGCCG TGGCGAATCT GGCCAAGGCA ACGGCTGCAA GTTACGATAG CAGCACCGGC GGCGAGCTGT TTAACCTGTT CTACTTCCTG CGAGGCGCCT TCTACATAGA GTATTACAAC GATGATCTCA GCTATGGTGA CAGCCGCGCC AGCGATGCCA TGCGTGAGCT GCTACTGGAA TATGCCAAGA ACCCGGCCTT TAACAGCCTG AGCGACATGC AGGGCAACAC ACTGCAGGAA TATTTCATCG CCTGGGACAG CTCCTACAAC TACTATGATT CGGTGGCCGT GATCACCGAT TATCTGAATC AGTTTAGCGA GCAGCATCTG GCCTCCTGGT ACCACAGAAG CGCGCTGACC AAGGCGCTCA CCACCCTGTA TCGCGGCAAC TGGGATGAAA AATATACCAA GGCGTCGATG GAGTATGATG CCCTGCGCGC CGCCTTGCTC AAGGTGGCGA CCTCGGACTA CATCATCAAC TCAGAGTATG CTTTCGAGTC CACGGATGCC TTTCACGAGT TTGGACGTTT CTACGAGTAT CAGGCCTACT GGAAACTACC GGATAGCCTG AAAACGGCCC TCAACGGCGG CGTGCAGCAA TATATGGCCA AGTTTAACCG TCTGTCTAAG CAGTGGGTGG ATGCGGCCGG TTATCTGGAT TACTACAACC CTGGCCAGTG CGACAGCTTC GGCATCTGCG GCTGGGAGGA TGAGGTGGAA GCCAGTGTGC TTGCTATTCA CTACAGCTGC TCGGACACCA TCAAGATCCG CGCTCAGCAG CTGACGGATC AGGAGCTGCA GGCCTCTTGC GAGCTCATGG GTGAGGAAGA GGGGCTGTTC CATCAGATCC TGGCGACCGG CGGCGAGCCA GTGGCGGACG ACTACAACGA AGATCTGCAG GTCAATATCT TCGATAGCTA TGACGACTAT GATGTCTACG CCGGCATCAT CTTCGGCATC AACACTGACA ACGGCGGCAT GTACCTGGAA GGCACGCCGT CGGATCCCGA CAACCAGGCA CGTTTTATCG CCCACGAGGC CACTTGGACC GACGACATCT TGGTGTGGAA CCTGAGACAC GAATATGTGC ACTACCTGGA TGGTCGCTTC AATCTTTACG GCGCCTTTAA CTACTTCGAT GTCGACACGG GTAAGTCTGT GTGGTGGGCC GAGGGCCTCG CCGAATACAT CTCCCATCAG AATCGCTACG ATGAGGCGAT CGATATCGGC CGCAGCCAAG AATTTAGCCT GAGCGAGATC TTGTCTAACA CCTATGACAG CGGTACCGAC AGGGTCTATC GCTGGGGCTA TCTGGCGGTG CGCTTCTTGT TCGAGCAGCA CAGAAGCGAT GTGGACGCTT TGTTAGTTCA CGCCCGCGCC GGTGATACCT CGGCCTGGCT CAGCTATATC GACAACACTA TCGGCACGAA CTATGACGCA GAGTGGAACA GCTGGCTGCT GACGGTGACT AGTAACGATA CGCTGCTCGA CGGTGGCATA GTGACCCCGG TCGACAGCGA TGGTGACGGG GTGATCGACA GCGAAGATGC CTTCCCCAAC GATCCGACCG AGTGGGCGGA TGAGAATGGT AACGGTATCG GCGATAACGC CGATGCGGCC AATGGTGGTG GGCAGACGGG GAACTGCGGC GCCGCAACCA TCAGCGACGG CAATATTACC CAGGAGCAGG CCGAGTGTGT GGCAGGCACA GGGGTGAACT ACTACTATAC CTATGTCGAG CAGGATAATA CTCAGCTCTA TATCAGCACC ACGGGCGGTG AGGGCGATCT GGATATCTAC TTTAACCAGC AGACCTGGGC GAGCCCGAGC GACTATGAGG TGAAGTCGCA AAACAGCGGC AACGAGGAGC TGATCAGCGT GGTCGCCAAT CGTGGTTGGG TCTATCTCTC GACGGTAGCC GTGACGCCAT TTGAGGGCGT TAGCCTCAAG ATCAGCCAGA GCGCCGACAC TACGCCGGAT AACGGTTCGC CTCAGGTGGC GGATGCCTGC CTAAGCCAAA GCCCTTATAG TTATGGCGGG GTAGAGTTTG GTCAGGCGGT GTGTGTGGAC GACGGCCATT CCAGCTATTA CTTCTATGTG CCGGCCAATA CGGCGGCTAT CGAGATCAAC ACGGCCCATG GTAGCGGCGA TCTGAACCTC TACGCCAACG CAACGACCTG GGCGAGCCCG ACGGCTTATC AGTTCAAGTC GGAAAACGTC GGCAACAGCG AGCAGATCCG CGTCGTTTCA CCTGCCGAAG GCTGGTTCTA TGTCAGCGCC GATGGAGCGC CCTCAAGCAG TGGCGCCAGT TTGCTGGTTA CCCTGGCTAG CAACTAA
|
Protein sequence | MTTNMTNKTL GGARRFTPTA LALACSLLAT PLVLHAQDVQ PQGMPAKDVK GPLSIKAAPS TKAAPPTKAA PAQKVKPSAD KAAASSRFDT SARKPIGARD AEQGEGVRPE KRDSKLEKGS KGQSALPGMK GNQGATGPAA EADACNGYSQ FDSLSGQALY DFVRQAEFSC ISELYSHNDA TAVKVYQADN VVAVANLAKA TAASYDSSTG GELFNLFYFL RGAFYIEYYN DDLSYGDSRA SDAMRELLLE YAKNPAFNSL SDMQGNTLQE YFIAWDSSYN YYDSVAVITD YLNQFSEQHL ASWYHRSALT KALTTLYRGN WDEKYTKASM EYDALRAALL KVATSDYIIN SEYAFESTDA FHEFGRFYEY QAYWKLPDSL KTALNGGVQQ YMAKFNRLSK QWVDAAGYLD YYNPGQCDSF GICGWEDEVE ASVLAIHYSC SDTIKIRAQQ LTDQELQASC ELMGEEEGLF HQILATGGEP VADDYNEDLQ VNIFDSYDDY DVYAGIIFGI NTDNGGMYLE GTPSDPDNQA RFIAHEATWT DDILVWNLRH EYVHYLDGRF NLYGAFNYFD VDTGKSVWWA EGLAEYISHQ NRYDEAIDIG RSQEFSLSEI LSNTYDSGTD RVYRWGYLAV RFLFEQHRSD VDALLVHARA GDTSAWLSYI DNTIGTNYDA EWNSWLLTVT SNDTLLDGGI VTPVDSDGDG VIDSEDAFPN DPTEWADENG NGIGDNADAA NGGGQTGNCG AATISDGNIT QEQAECVAGT GVNYYYTYVE QDNTQLYIST TGGEGDLDIY FNQQTWASPS DYEVKSQNSG NEELISVVAN RGWVYLSTVA VTPFEGVSLK ISQSADTTPD NGSPQVADAC LSQSPYSYGG VEFGQAVCVD DGHSSYYFYV PANTAAIEIN TAHGSGDLNL YANATTWASP TAYQFKSENV GNSEQIRVVS PAEGWFYVSA DGAPSSSGAS LLVTLASN
|
| |