Gene Sde_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0031 
Symbol 
ID3968164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp34108 
End bp36165 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content46% 
IMG OID637919090 
Productoligopeptidase A 
Protein accessionYP_525507 
Protein GI90019680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACC CTTTGTTAAA TCATCAGAAC TTGCCAGCAT TTTCTACTGT TAAGCCTGAA 
CATGTGGTGA ATGCAGTAGA AACAATTATT AAAGAAAATG AGCAAACTTT AGCCGAAGTG
CTTGCGCAAG AGGGTACGCC AACCTGGGAA ACCCTAGTGG CACCCTTAGA TGAAAAAAGC
GATAAGTTAG ACAAGGCTTG GGCGGTAGTG AGCCATCTAA ACTCAGTAGC TAACAGTGAT
GAGTTGCGCG AAGCTTACGA GGCGGGGGAG CAATTGCTTA CCCAGTACTA TGCCAAAATA
GGTCAAAACA AAGCGCTTTA TCAAGCCTAC GAGGCGCTGG CCCAAGAGCC ATATTTTAGC
AGCCTCAGCC AAGCCCAAAA GCAAACTGTA GAAAATGCGC TGCGCAATTT TCGCTTAGCG
GGTGTTGCTC TAGAGGCCGA ACAGCAAGCG CGTTTTACCG AAATTCAATC TGAGCTATCT
AGCCTAACTA CACAGTTTGC CAACAACGTG ATGGATGCAA CGCAAGGTTG GTACAAACAG
GTAACCGACG AAGCACACTT AGCAGGTATT CCTGCAATGG CGAAAGAAGC TGCGGCCAAT
GCAGCCAAAA AGAAAGATGT GCAAGGCTGG GTATTCACAC TTGATATCCC GTCTTACTTG
GCAGTAATAA CTCACGCCGA TAACCGAGCG CTGCGCGAAG AAATGTACCG CGCGTTTGCC
ACCCGCGCCT CGAGCGAAAG CCCAGTAGCG GATGAGCAAA AAGCAAAATG GGACAACACC
CCGCTGATAG AAAAAATTCT TGCTCTGCGC TTAGAAAAAG CGCAACTGTT AGGGTTTAAT
AATTATGCCG AAGTATCTAT TGCTCCCAAA ATGGCAGAGA GCACCGAGCA AGTAATTGGC
TTTTTAGAAG ACCTAGCCGC AAAGGCCAAA CCACAAGCCG CAAAAGAGAA AGCTACACTT
GAGCAGTTTG CTAAAACCGA ACTCGGCCTA GATGAATTAA ACGCATGGGA TGTGGCTTAC
GCGTCGGAAA AACTAAAAGA AAAAACCTTT AACGTTTCAC AAGAAGCATT GCGTGAATAT
TTTCCGCTAC AAAAAGTACA GGCGGGTATG TTTACGTTGG TAGAGAAACT ATTTGATGTG
CGTATTCAAG AAAATACCGA ATCAGACACC TACCACCCAG ATGTAAGTTA TTTCGATATT
TATCGCAACG ATACGTTAAT TGCCAGTTTT TACTGGGATT TATTCGCCCG CGAGAAAAAA
CGCGGTGGCG CGTGGATGGC CGATGGCCGC ATTCGTCGCA AAACGGCAGA GGGCTTGCAA
AAACCGGTTG CATTTTTAAC CTGTAATTTT AATGGCCCCG TTGGCGATAA GCCTGCGTTA
TTAACCCACG ACGAAGTGAC CACATTATTC CATGAGTTTG GTCACGGCTT GCACCACATG
CTTACCCAAA TAGATATCCC AGCAGTAAGC GGCATTAATG GCGTAGCGTG GGATGCCGTT
GAGTTGCCCA GTCAATTTAT GGAAAACTTC TGCTGGGAAA AAGAAGTACT GCCGTTAATT
TCTGGCCACT ACGCCACCAA CGAACCTTTG CCAGAAGACA TGCTTAATAA CATGATTGCC
GGCCGTAACT TTCAATCGGC TATGCAAATG GTACGTCAGT TAGAGTTTTC TTTATTTGAT
TTTGTGTTGC ATAAAGATTT TGGCACTGAA CAATTTAGCG ATGTGCAAAG TGTGTTAGAT
ACTATTCGCG ATAAAGTTTC GGTAATTGTT CCACCCAGCT TTAATAAATT CCAAAACAGT
TTTACACATA TTTTTGCCGG TGGTTATGCG GCGGGTTATT ACAGCTATAA GTGGGCAGAG
GTGTTATCGG CCGATGCGTT CGCGGCGTTT GAAGAAGAAG GTGTACTTAA CGAAGCAACC
GGCAAGCGCT TTTTGCACAG TATATTAGAG CAGGGCGGTT CACAGCAGGC GATGGTATTG
TTTGAGAATT TCCGCGGTAG AAAACCCAGT GTGGACCCGC TTTTGCGCCA TTCTGGCATT
GTAGAGGTAG CAGCGTGA
 
Protein sequence
MSNPLLNHQN LPAFSTVKPE HVVNAVETII KENEQTLAEV LAQEGTPTWE TLVAPLDEKS 
DKLDKAWAVV SHLNSVANSD ELREAYEAGE QLLTQYYAKI GQNKALYQAY EALAQEPYFS
SLSQAQKQTV ENALRNFRLA GVALEAEQQA RFTEIQSELS SLTTQFANNV MDATQGWYKQ
VTDEAHLAGI PAMAKEAAAN AAKKKDVQGW VFTLDIPSYL AVITHADNRA LREEMYRAFA
TRASSESPVA DEQKAKWDNT PLIEKILALR LEKAQLLGFN NYAEVSIAPK MAESTEQVIG
FLEDLAAKAK PQAAKEKATL EQFAKTELGL DELNAWDVAY ASEKLKEKTF NVSQEALREY
FPLQKVQAGM FTLVEKLFDV RIQENTESDT YHPDVSYFDI YRNDTLIASF YWDLFAREKK
RGGAWMADGR IRRKTAEGLQ KPVAFLTCNF NGPVGDKPAL LTHDEVTTLF HEFGHGLHHM
LTQIDIPAVS GINGVAWDAV ELPSQFMENF CWEKEVLPLI SGHYATNEPL PEDMLNNMIA
GRNFQSAMQM VRQLEFSLFD FVLHKDFGTE QFSDVQSVLD TIRDKVSVIV PPSFNKFQNS
FTHIFAGGYA AGYYSYKWAE VLSADAFAAF EEEGVLNEAT GKRFLHSILE QGGSQQAMVL
FENFRGRKPS VDPLLRHSGI VEVAA