Gene Sde_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1857 
Symbol 
ID3966900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2347694 
End bp2350645 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content49% 
IMG OID637920940 
Productputative metalloprotease 
Protein accessionYP_527329 
Protein GI90021502 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.263954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.471387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA GCGCAGACTC GGTAACAAAC ACTCCCGCTC ACCACAAAAG CTTTAACTAC 
ATCCGCTCAC AGTATGTTGC ATCGCTTAAC TTAACCGTGC AGGAATTTGA GCACAAAGTA
ACGGGAGCCA AACACTACCA CTTGGCCGCA GACAACCAAG AAAACGTATT TTTAGTGGCG
TTGCGCACCG TGCCACAAGA TAGCCGTGGC GTAGCCCATA TTCTAGAGCA CACCGCACTT
TGCGGCAGCG ATAAATACCC TGTGCGCGAC CCATTTTTTA TGATGACGCG CCGCTCGTTA
AACACGTTTA TGAATGCCTT CACCAGCTCT GACTGGACTG CTTACCCGTT TGCGAGCATG
AACAGCAAAG ATTACAACAA CCTATTAGAT GTGTATTTAG ATGCGGTGTT TTTCTCGCGC
CTAGACCCTC TCGACTTTGC CCAAGAAGGC CACCGCCTCG AATTCGAAAC CGCAGACGAC
CCAAGCTCTA AGCTGGTGTA CAAAGGCGTT GTGTTTAACG AAATGAAAGG CGCTATGAGC
TCGGTCCCGT CGCAGCTTTG GCAAACTCTC ACTAAGTACT TGTTTCCAAA TAACACCTAT
CACCACAACA GCGGTGGTGA CCCAGAAGCC ATTCCCGATT TAACCTACGA GCAGCTGATT
ACGTTCTACA AAACCCACTA CCACCCAAGC AACGCTATTT TCATGACCTA TGGCGACAAA
CCAGCGGCTG AACTACAAGA AAAATTTGAA ACCCAAGCGC TATGTAAGTT TGAACGGTTA
GATAAAGAAA TTTCGGTGCC CGACGAAAAA CGCTACTACG CGCCGGTAAA AGTGCAAGAA
AGCTACCCCT ACACCCAAGC AGACGCTGCC GACAAACATC ACGTTGTACT GGGTTGGTTA
TTAGGTAAAT CCACCAATTT GCAAGAAGCG CTGCAAGCAC AGTTGCTATC AAGCGTGTTA
ATGGATAACA GTGCAACTCC ACTTATGCAC GCACTCGAAT CTACCGAGCT TGGCTCTTCC
CCCTCGCCCA TTTGCGGCCT AGATGATTCG CAAAAAGAAC TTACATTTAT TTGTGGCTTA
GAGGGCTGCC AAGAAGGCAG CGCAGATGAT GTAGAGCAAT TAATTGAAAG CACCTTGCGC
AAAGTAGCCG AAGAAGGCAT TCCACAAGAA GACGTTGAAT CTGCACTGCA TCAATTAGAA
TTACACCAGC GCGAAGTGGG CGGCGACAGC TATCCATTTG GCCTTCAGCT AATAATGACC
GCCCTTACCG CGGCTACACA CCGTGGCGAC CCAGTAAAAT TATTAGATGT TGATGCAGCA
CTGGATCAGC TGCGTGAAGA TATAAAACAG CCAAACTTTA TTAAAACACT TATCGAAAAG
TGGCTGCTAA ACAACCCGCA TAAAGTACGG TTAACGCTTA GCCCCAATGC CGAGCTGCAG
GCGCGTAAAG AGCAAGCAGA GCTAGAGAAA CTTGCGCAAA TACAAGCGAA CCTAACCGAA
GAAGATAAAC AAAACATTGT TGAGTTGGCC CAAGCGCTTA AGAAACGCCA AGACCAAATT
GATGACGAAA GCATTCTGCC CAAAGTGACT CTTAGCGATG TACCCGCAGC AGAAGATGCA
ACCACAGGTG AAACGACCAC ACTTGCGCCC AACAACCCCA CACCCGTTAC GCGCTATAGC
GCAGGTACCA ATGGCCTTGT GTACCAACAA TTGATATACC CGTTGCCGCA GTTTTCGCCC
GAACAGCTTA ACGTATTGCC GCTGTACAAC ACCTGCATTA CTGAGCTTGG GCTCGGTGAA
AAAGATTACT TAAACAGCCA GCGCTGGCAA GCTAGTGTTA CAGGCGGCCT TTCTGCTTTT
AGCTCAATTC GCGGCGATGT TAACGACCCT CATAAATTAA CCAGTTATTA CGCGTACTCA
GGTAAAGCAC TTAATCGCAA CCAAGCAGCG CTTAGTGATG TATTAAAAGC ATTAATAGAA
ACGCCACGCT TTGACGAAAC TGGCCGCATT GCCGAATTGG TATCGCAAAT ACGCGCCCAC
CGCGAGCAAA GCGTTACCGG CAGTGGCCAC AGCTTAGCCA TGGCTGCAGC ATCTTCTGGC
ATTTGCGCTG CGGCTAACTT GCAACATAAC TGGTCTGGCT TAGCCGGCAT TAAAGCGCTT
AAACAGCTGG ACGACGGTTT GCGGTCCGAA GGCGGCCTAG AAGCGTTATC AAATTTATTT
ACCAGTATTC ACGAATTAAT TACTGCGTCG CAATTTCAAG CTTTGCTTAT TGGTGAGCAT
GCCCATTTAG ACGAGATGCA AGCCCAATTA GCAGCCCGTT TTACAAGCAA TACCCAAACC
CCAGCTACAG GCTTTGACCT TGGCAAACAA GTTCGCCAAG TTAACGATGC GTGGTTAACC
GATTCACAAG TAAACTTTTG TTCTAAAGCC TACGCCACAG TTCCTATGGA CCACCCAGAC
GCGGCCGCAC TTGTTGTGCT CGCCGGCGTA ATGCGCAATA GTTTTTTACA CAAAGCCATT
CGCGAGCAAG GCGGCGCATA CGGTGGTGGT GCAAGCCAAG ATAGCACTAT TGGCGCTTTC
CGCTTTTACT CCTATCGCGA CCCGCGCTTA GAAGAAACCC TTGCAGACTT TGACCGCAGT
GTAGAGTGGG CAATAACCGA GAATCACGAT AGACAAAAAG TAGAGGAAGC CATTTTAGGT
ACTATTGGCT CATTAGATCG CTCCGAGTCC CCTGCGGGTA AAGCAAAACG CTGTTTTTAT
CAAGAATTAC ACGGGCGCCC ACCCGAAGTA CGTCAACGCT TCCGCGAGCG AATTTTAGCC
GTTGAAGCCA GTGATTTGCG CCGCGTAGCG GAACTGTATC TAAAACCCGA CAATGCACAC
ATTGCAGTTG TGACGGATTT CAGTAAAAAA GACAAAATGG AAGCAGCGGG CCTTACAATC
CAAACGCTGT AA
 
Protein sequence
MTASADSVTN TPAHHKSFNY IRSQYVASLN LTVQEFEHKV TGAKHYHLAA DNQENVFLVA 
LRTVPQDSRG VAHILEHTAL CGSDKYPVRD PFFMMTRRSL NTFMNAFTSS DWTAYPFASM
NSKDYNNLLD VYLDAVFFSR LDPLDFAQEG HRLEFETADD PSSKLVYKGV VFNEMKGAMS
SVPSQLWQTL TKYLFPNNTY HHNSGGDPEA IPDLTYEQLI TFYKTHYHPS NAIFMTYGDK
PAAELQEKFE TQALCKFERL DKEISVPDEK RYYAPVKVQE SYPYTQADAA DKHHVVLGWL
LGKSTNLQEA LQAQLLSSVL MDNSATPLMH ALESTELGSS PSPICGLDDS QKELTFICGL
EGCQEGSADD VEQLIESTLR KVAEEGIPQE DVESALHQLE LHQREVGGDS YPFGLQLIMT
ALTAATHRGD PVKLLDVDAA LDQLREDIKQ PNFIKTLIEK WLLNNPHKVR LTLSPNAELQ
ARKEQAELEK LAQIQANLTE EDKQNIVELA QALKKRQDQI DDESILPKVT LSDVPAAEDA
TTGETTTLAP NNPTPVTRYS AGTNGLVYQQ LIYPLPQFSP EQLNVLPLYN TCITELGLGE
KDYLNSQRWQ ASVTGGLSAF SSIRGDVNDP HKLTSYYAYS GKALNRNQAA LSDVLKALIE
TPRFDETGRI AELVSQIRAH REQSVTGSGH SLAMAAASSG ICAAANLQHN WSGLAGIKAL
KQLDDGLRSE GGLEALSNLF TSIHELITAS QFQALLIGEH AHLDEMQAQL AARFTSNTQT
PATGFDLGKQ VRQVNDAWLT DSQVNFCSKA YATVPMDHPD AAALVVLAGV MRNSFLHKAI
REQGGAYGGG ASQDSTIGAF RFYSYRDPRL EETLADFDRS VEWAITENHD RQKVEEAILG
TIGSLDRSES PAGKAKRCFY QELHGRPPEV RQRFRERILA VEASDLRRVA ELYLKPDNAH
IAVVTDFSKK DKMEAAGLTI QTL