Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_1857 |
Symbol | |
ID | 3966900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 2347694 |
End bp | 2350645 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637920940 |
Product | putative metalloprotease |
Protein accession | YP_527329 |
Protein GI | 90021502 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.263954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.471387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCCA GCGCAGACTC GGTAACAAAC ACTCCCGCTC ACCACAAAAG CTTTAACTAC ATCCGCTCAC AGTATGTTGC ATCGCTTAAC TTAACCGTGC AGGAATTTGA GCACAAAGTA ACGGGAGCCA AACACTACCA CTTGGCCGCA GACAACCAAG AAAACGTATT TTTAGTGGCG TTGCGCACCG TGCCACAAGA TAGCCGTGGC GTAGCCCATA TTCTAGAGCA CACCGCACTT TGCGGCAGCG ATAAATACCC TGTGCGCGAC CCATTTTTTA TGATGACGCG CCGCTCGTTA AACACGTTTA TGAATGCCTT CACCAGCTCT GACTGGACTG CTTACCCGTT TGCGAGCATG AACAGCAAAG ATTACAACAA CCTATTAGAT GTGTATTTAG ATGCGGTGTT TTTCTCGCGC CTAGACCCTC TCGACTTTGC CCAAGAAGGC CACCGCCTCG AATTCGAAAC CGCAGACGAC CCAAGCTCTA AGCTGGTGTA CAAAGGCGTT GTGTTTAACG AAATGAAAGG CGCTATGAGC TCGGTCCCGT CGCAGCTTTG GCAAACTCTC ACTAAGTACT TGTTTCCAAA TAACACCTAT CACCACAACA GCGGTGGTGA CCCAGAAGCC ATTCCCGATT TAACCTACGA GCAGCTGATT ACGTTCTACA AAACCCACTA CCACCCAAGC AACGCTATTT TCATGACCTA TGGCGACAAA CCAGCGGCTG AACTACAAGA AAAATTTGAA ACCCAAGCGC TATGTAAGTT TGAACGGTTA GATAAAGAAA TTTCGGTGCC CGACGAAAAA CGCTACTACG CGCCGGTAAA AGTGCAAGAA AGCTACCCCT ACACCCAAGC AGACGCTGCC GACAAACATC ACGTTGTACT GGGTTGGTTA TTAGGTAAAT CCACCAATTT GCAAGAAGCG CTGCAAGCAC AGTTGCTATC AAGCGTGTTA ATGGATAACA GTGCAACTCC ACTTATGCAC GCACTCGAAT CTACCGAGCT TGGCTCTTCC CCCTCGCCCA TTTGCGGCCT AGATGATTCG CAAAAAGAAC TTACATTTAT TTGTGGCTTA GAGGGCTGCC AAGAAGGCAG CGCAGATGAT GTAGAGCAAT TAATTGAAAG CACCTTGCGC AAAGTAGCCG AAGAAGGCAT TCCACAAGAA GACGTTGAAT CTGCACTGCA TCAATTAGAA TTACACCAGC GCGAAGTGGG CGGCGACAGC TATCCATTTG GCCTTCAGCT AATAATGACC GCCCTTACCG CGGCTACACA CCGTGGCGAC CCAGTAAAAT TATTAGATGT TGATGCAGCA CTGGATCAGC TGCGTGAAGA TATAAAACAG CCAAACTTTA TTAAAACACT TATCGAAAAG TGGCTGCTAA ACAACCCGCA TAAAGTACGG TTAACGCTTA GCCCCAATGC CGAGCTGCAG GCGCGTAAAG AGCAAGCAGA GCTAGAGAAA CTTGCGCAAA TACAAGCGAA CCTAACCGAA GAAGATAAAC AAAACATTGT TGAGTTGGCC CAAGCGCTTA AGAAACGCCA AGACCAAATT GATGACGAAA GCATTCTGCC CAAAGTGACT CTTAGCGATG TACCCGCAGC AGAAGATGCA ACCACAGGTG AAACGACCAC ACTTGCGCCC AACAACCCCA CACCCGTTAC GCGCTATAGC GCAGGTACCA ATGGCCTTGT GTACCAACAA TTGATATACC CGTTGCCGCA GTTTTCGCCC GAACAGCTTA ACGTATTGCC GCTGTACAAC ACCTGCATTA CTGAGCTTGG GCTCGGTGAA AAAGATTACT TAAACAGCCA GCGCTGGCAA GCTAGTGTTA CAGGCGGCCT TTCTGCTTTT AGCTCAATTC GCGGCGATGT TAACGACCCT CATAAATTAA CCAGTTATTA CGCGTACTCA GGTAAAGCAC TTAATCGCAA CCAAGCAGCG CTTAGTGATG TATTAAAAGC ATTAATAGAA ACGCCACGCT TTGACGAAAC TGGCCGCATT GCCGAATTGG TATCGCAAAT ACGCGCCCAC CGCGAGCAAA GCGTTACCGG CAGTGGCCAC AGCTTAGCCA TGGCTGCAGC ATCTTCTGGC ATTTGCGCTG CGGCTAACTT GCAACATAAC TGGTCTGGCT TAGCCGGCAT TAAAGCGCTT AAACAGCTGG ACGACGGTTT GCGGTCCGAA GGCGGCCTAG AAGCGTTATC AAATTTATTT ACCAGTATTC ACGAATTAAT TACTGCGTCG CAATTTCAAG CTTTGCTTAT TGGTGAGCAT GCCCATTTAG ACGAGATGCA AGCCCAATTA GCAGCCCGTT TTACAAGCAA TACCCAAACC CCAGCTACAG GCTTTGACCT TGGCAAACAA GTTCGCCAAG TTAACGATGC GTGGTTAACC GATTCACAAG TAAACTTTTG TTCTAAAGCC TACGCCACAG TTCCTATGGA CCACCCAGAC GCGGCCGCAC TTGTTGTGCT CGCCGGCGTA ATGCGCAATA GTTTTTTACA CAAAGCCATT CGCGAGCAAG GCGGCGCATA CGGTGGTGGT GCAAGCCAAG ATAGCACTAT TGGCGCTTTC CGCTTTTACT CCTATCGCGA CCCGCGCTTA GAAGAAACCC TTGCAGACTT TGACCGCAGT GTAGAGTGGG CAATAACCGA GAATCACGAT AGACAAAAAG TAGAGGAAGC CATTTTAGGT ACTATTGGCT CATTAGATCG CTCCGAGTCC CCTGCGGGTA AAGCAAAACG CTGTTTTTAT CAAGAATTAC ACGGGCGCCC ACCCGAAGTA CGTCAACGCT TCCGCGAGCG AATTTTAGCC GTTGAAGCCA GTGATTTGCG CCGCGTAGCG GAACTGTATC TAAAACCCGA CAATGCACAC ATTGCAGTTG TGACGGATTT CAGTAAAAAA GACAAAATGG AAGCAGCGGG CCTTACAATC CAAACGCTGT AA
|
Protein sequence | MTASADSVTN TPAHHKSFNY IRSQYVASLN LTVQEFEHKV TGAKHYHLAA DNQENVFLVA LRTVPQDSRG VAHILEHTAL CGSDKYPVRD PFFMMTRRSL NTFMNAFTSS DWTAYPFASM NSKDYNNLLD VYLDAVFFSR LDPLDFAQEG HRLEFETADD PSSKLVYKGV VFNEMKGAMS SVPSQLWQTL TKYLFPNNTY HHNSGGDPEA IPDLTYEQLI TFYKTHYHPS NAIFMTYGDK PAAELQEKFE TQALCKFERL DKEISVPDEK RYYAPVKVQE SYPYTQADAA DKHHVVLGWL LGKSTNLQEA LQAQLLSSVL MDNSATPLMH ALESTELGSS PSPICGLDDS QKELTFICGL EGCQEGSADD VEQLIESTLR KVAEEGIPQE DVESALHQLE LHQREVGGDS YPFGLQLIMT ALTAATHRGD PVKLLDVDAA LDQLREDIKQ PNFIKTLIEK WLLNNPHKVR LTLSPNAELQ ARKEQAELEK LAQIQANLTE EDKQNIVELA QALKKRQDQI DDESILPKVT LSDVPAAEDA TTGETTTLAP NNPTPVTRYS AGTNGLVYQQ LIYPLPQFSP EQLNVLPLYN TCITELGLGE KDYLNSQRWQ ASVTGGLSAF SSIRGDVNDP HKLTSYYAYS GKALNRNQAA LSDVLKALIE TPRFDETGRI AELVSQIRAH REQSVTGSGH SLAMAAASSG ICAAANLQHN WSGLAGIKAL KQLDDGLRSE GGLEALSNLF TSIHELITAS QFQALLIGEH AHLDEMQAQL AARFTSNTQT PATGFDLGKQ VRQVNDAWLT DSQVNFCSKA YATVPMDHPD AAALVVLAGV MRNSFLHKAI REQGGAYGGG ASQDSTIGAF RFYSYRDPRL EETLADFDRS VEWAITENHD RQKVEEAILG TIGSLDRSES PAGKAKRCFY QELHGRPPEV RQRFRERILA VEASDLRRVA ELYLKPDNAH IAVVTDFSKK DKMEAAGLTI QTL
|
| |