Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_0039 |
Symbol | |
ID | 5613695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | - |
Start bp | 45367 |
End bp | 47139 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640930862 |
Product | collagenase |
Protein accession | YP_001471780 |
Protein GI | 157373180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000449009 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0513701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGCC GTTCTCTTAT TGCCATCACG CTTATCTCTC CCACACTCAT GACATTAAAT GGGTGTAGCA GCTCTCCTAA CTTAATAGTA GACCAGACAC TCATCAGCCA GATTAATCTG GCGAATAAAG ATTCATTTTT TGATGCCAGC AACCCGCAGT TAAACAAAAA TGCATTTGAA CAGATAAGTC TTGCCTTAGC GACAACAAGC GATCAGCAAC AAGCCGATAA CCTTCTTTAT TATCTACGCG CGTTTAGCTA TTTTGGGCCC ATCGATGAGC TTGATGACAC AAGCTACGAA TCACTCACCT TCGGGCTTAA ACAACTCGCT AATAATGAGT TGTTGAGTCA CTCCTCTCGG CTACAGGAAC AATATGCTGT TACCCTGTAT CGTTACTACG CTAACAGTGA ACGTGCAACA CAACTTGCCA CCCTCTTACC TCAACTTGAC TCACAGTTAT CCTCACTAGG GCAGAGCGCT TCCAATACGG ATAATGACTA CGCACTTTGG GAGACGCTCA GAGCCTATGG TCTGTTATTT AATACAGCCA GAAAAGAAGC GGATGGGGAT CTTAACAAAT TATTGGTAGA ACAGGATTTG GCGCGTCCGC TACTGGAGTT TGCAGCATCA ACGACAAGCA TAAGATCCGA TAATGACTGG CCTAAGGCAA ATGCTTATTG GGCTTTAGGG CTGTACCGTT TAGCACTACC GGCGAGTGAG AATGATGAGC CTACCCCTTC AGAGCAGAAG ATAGACGATG CGGTTGCCGA TATAGCCAAG CAAGACATTA AGCTACGCGG TGACAAAGCC AAAGATACCT ATACCTTGGG CTATCATGTC AATGCTTTTG CGGGTAAAGA GGCATGCGAA AACAACAGTG AGCTCTGCCG CATACCAGAG TTAGAAGATG TTCTGCCTAT CAATCATTCC TGCTCTGACA GTTTATTTAT TCTCGCACAA GACTTAAATG AGCAAGAGCT TGCGACAAGC TGCACTAAAC TCACCTCGCA AGAGGCCAAC TTTCATCAAG TTCTGGAAAC CCGATATCAG CCTACCGCCA ATGACTTCAA CAATGCACTG CGTGTTGTGG CGTTTAAAAA TTGGAGCCAG TACAACGCAT ACGGACAACT ACTTTTCGAC ATAGATACTG ATAATGGTGG CATGTATATC GAGGGTACTC CGTCAAAGCC TGGTAACCAG GCAACATTCT TCGCCTACCG ACAGTTCTGG ATCGAGCCAG AATTTGCTAT CTGGAATCTA AACCACGAAT ACGTACATTA CCTCGACGGT CATTTCGTCA AATATGGTGG CTTCGGGCAT TTCCCGGAGA AGATGGTGTG GTGGTCGGAA GGCTTGGCCG AATACATATC CAAGGGCAAC GACAATCCAA ATACGTTAAA AGTGATCAAG AAAGATATCG ATAAGGCCCC CAGCCTTGAA GAGATCTTTG CCACTGAGTA TAAAGACGGG CAAGACAGAA CATATAAGTG GAGTTATATG GCAGTGCGCT TTCTGGTTGA AAACCACCAT TCAGATTTCG TTCAACTGAG TCATTATTTA AAAACGGATT ACTTTGAAGG TTACGCTGAG TTGATGGCTG AGCTCACTGA CCATCAGGCG CAATTTTCAG ATTGGCTAAA TGTACAGGTG GAACAGTTTG ATGATAGCGA AGAGAAAGCT AAGCCCAGAC TCAATAAGCA AAATAGATAT AGCTACCGGG ATTACCTGCG ACCCGCTCAC TTGGTAAAAG ATGATGCTCA CAGACATTAT TAG
|
Protein sequence | MFRRSLIAIT LISPTLMTLN GCSSSPNLIV DQTLISQINL ANKDSFFDAS NPQLNKNAFE QISLALATTS DQQQADNLLY YLRAFSYFGP IDELDDTSYE SLTFGLKQLA NNELLSHSSR LQEQYAVTLY RYYANSERAT QLATLLPQLD SQLSSLGQSA SNTDNDYALW ETLRAYGLLF NTARKEADGD LNKLLVEQDL ARPLLEFAAS TTSIRSDNDW PKANAYWALG LYRLALPASE NDEPTPSEQK IDDAVADIAK QDIKLRGDKA KDTYTLGYHV NAFAGKEACE NNSELCRIPE LEDVLPINHS CSDSLFILAQ DLNEQELATS CTKLTSQEAN FHQVLETRYQ PTANDFNNAL RVVAFKNWSQ YNAYGQLLFD IDTDNGGMYI EGTPSKPGNQ ATFFAYRQFW IEPEFAIWNL NHEYVHYLDG HFVKYGGFGH FPEKMVWWSE GLAEYISKGN DNPNTLKVIK KDIDKAPSLE EIFATEYKDG QDRTYKWSYM AVRFLVENHH SDFVQLSHYL KTDYFEGYAE LMAELTDHQA QFSDWLNVQV EQFDDSEEKA KPRLNKQNRY SYRDYLRPAH LVKDDAHRHY
|
| |