Gene Sde_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3604 
Symbol 
ID3966466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4564876 
End bp4567197 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content48% 
IMG OID637922701 
Producthypothetical protein 
Protein accessionYP_529071 
Protein GI90023244 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0522431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00718988 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAGCC TGCCTCTTTA CCCCGCCAGC CCTAGCGGCG TACCTCAGGA ATTAACCAAA 
GCCAAAGCCA GTTATAAGCG CCAAGCCATG TATGCCATGG CGGGCTTGTT GGTATTTATG
CTGTTGTATA TTGCGTTAAT GGTGAGCTTT GGTTTTATTA GTTACCAAGG CTTCTTGGCG
CTAAGTAGTG TGTTCGATCT GGTTACCCTA TTTATTAGCG TTGTTTCGCT AATGCTTGCT
GTGTTTATGG CAAAATCGCT GTTTGCTGTG CGCAAATCTG GCGACCCGCG CGGTATAGAA
GTTACTGCCG AGCAAGAGCC AAAATTATTT GAATTTTTAA ATACACTCGC CGATGAAGTG
GGCGCGCCTA AACCGCACCG TGTATTTCTT ACCCCAGAAG TAAACGCCGC AGTGTTTTAC
GATTTATCGC TGTGGAATTT ACTTTTTCCC TCTAAGAAAA ATTTAATTGT TGGCTTGGGT
TTGGTTAATG TCTTAAACCT TGGCGAATTT AAAGCTGTGC TCGCCCACGA ATTTGGGCAT
TTTGCACAGG GCTCGATGAT GGTGGGGCGC TGGGTGTATA TAGCGCAACA AATTATTGGC
CATATGGTGG CTACCCGCGA CTGGTTAGAT AAAACCGTAA GTTTTATTTC GCGCATTGAT
TTACGTATAG CCTGGGTTGG GTGGTTGTTG TCGTTGGTAA TGTGGGCCAT GCGCAGTGTA
GTGGATACCC TGTTTCGCGT GGTAATTATT GCCGAGCGCG CGCTTAGCCG CGAAATGGAA
TTTAATGCCG ATTTAGTGGC GGTAAGCGTA ACCGGTAGTG ATGCGCTGGT GAATGCACTA
CACAAATTGC AAGCTGCCGA TCACGCTTGG CAAACGGCAT TGAACATAGC CGGCCGCGAA
GCTGGCTCGG GTAAATTAGT AGACGATTTA TTTTTGGCGC AACAAGAAGC CATTAACCAA
TTGCGCAGAG TTATGGCCGA CGATACCTAT GGGGCTACAC CAGAATTACC GGCAGAAAAC
CTGCGCCCTG CGCACCGCGT GTTTGATGCC GATTCTGCGC GGCCACCGCA AATGTGGGCA
ACACACCCTG CAAACCGCGA CCGAGAAGAT AATGCCAAAA GCGTATATGT ACCCGCCGAA
ATAGATTCTC GTTCTGCGTG GTTAGTGTTT AGCGATGCGC AGGCCATAAG AAATAAAGTA
AGTGTAGATA TTTTTAACAC CGAAAAAGCC AAAGAGTGGG AAAAAACCTG CCCACAGCAA
GCTGTACAAG CGCGCTTTAG TCGCGCTTCG TATTCGCCGG AATATCGCGG TACGTATTTG
AATCGCAGTT GGGCTAGAAA TTTTGAATCG GCGGAAGAAA TTTTTTCCTT TGGTTCTGAG
AAGGCTAGCG CGAAAGAATC CTTGGCCGAT ATTTACCCGC AATCTCTGCG CGACGATTTA
GAAGCCGCGC GCAGCTTAGA TATAGAACGC TCCACCTTAC AGGCATTGGC TTCGGGTGAG
CTTAAACCTT CTGGGGGTAT TATTCGGCAC CGCGGAGAAG AGCTTAAAAA AGGGGATATT
CCCAATGCAA TTGCAGCCAT TAGCAAAGAG CGCAAAGTGG TTGCCGACAG GTTGAAATTA
CATGATGCAT GTTGCCGCCG CGCCCACCTG CAAGCCGCAA AAGAGCTGGG GCAAGATTGG
GATGTTTATT TGCAAAGCTT GGTGGCGTTA GTGCATTGCA CCGAACATTT AAGTGCAAAG
GTCGAAGACG AACTTGCGTT TATGGTAAAC ACTTGGCAAG TAATAACAGC CGACGGCCAA
ATAGGCTTTT TTGAAAAACG CCGTATGCTT AAAGCCTGTA ATGGTGTGCA GGCGGTAATG
GAGGAAGTGT CGGTTGCCTT GGCTAAAATT ACGCTGCCTG CGGCAATATT AGAAGAAATT
GGCATTCACA GCTGGAGCAA AGAATGCCCC AAGTTTGAAT TGGTGGATGT AAATAAAAAG
AACTGGGGTC AGTGGTGCCC CGCCGCTGCA GAGCAAATGG AAAATATAAA TCACGCGCTA
AGTGTGCTGA ATAATATTGC GCTCGAAACG CTTATTACTA CCGAGGCAGA ATTAAAACAG
CATATTGAGC AGGGCACCCA GCCCAGCAAG GCGCCATACC CTGGGGCTGC GCCGCGCAAT
CATCCAAAAT TAATGCCGGG TGACGAGCAT GTGCTACAGC GCAAACTCGA TTTGTGGAAT
CGCTTTCAGC TTGCGCACGG TTTAGCGCCA ACCCTTGCGC GCTTGTTGGT ATCTATGGGT
ATTGTTGGGG GTACTATATA CAGCGGTGTG ATGTTTATTT AG
 
Protein sequence
MNSLPLYPAS PSGVPQELTK AKASYKRQAM YAMAGLLVFM LLYIALMVSF GFISYQGFLA 
LSSVFDLVTL FISVVSLMLA VFMAKSLFAV RKSGDPRGIE VTAEQEPKLF EFLNTLADEV
GAPKPHRVFL TPEVNAAVFY DLSLWNLLFP SKKNLIVGLG LVNVLNLGEF KAVLAHEFGH
FAQGSMMVGR WVYIAQQIIG HMVATRDWLD KTVSFISRID LRIAWVGWLL SLVMWAMRSV
VDTLFRVVII AERALSREME FNADLVAVSV TGSDALVNAL HKLQAADHAW QTALNIAGRE
AGSGKLVDDL FLAQQEAINQ LRRVMADDTY GATPELPAEN LRPAHRVFDA DSARPPQMWA
THPANRDRED NAKSVYVPAE IDSRSAWLVF SDAQAIRNKV SVDIFNTEKA KEWEKTCPQQ
AVQARFSRAS YSPEYRGTYL NRSWARNFES AEEIFSFGSE KASAKESLAD IYPQSLRDDL
EAARSLDIER STLQALASGE LKPSGGIIRH RGEELKKGDI PNAIAAISKE RKVVADRLKL
HDACCRRAHL QAAKELGQDW DVYLQSLVAL VHCTEHLSAK VEDELAFMVN TWQVITADGQ
IGFFEKRRML KACNGVQAVM EEVSVALAKI TLPAAILEEI GIHSWSKECP KFELVDVNKK
NWGQWCPAAA EQMENINHAL SVLNNIALET LITTEAELKQ HIEQGTQPSK APYPGAAPRN
HPKLMPGDEH VLQRKLDLWN RFQLAHGLAP TLARLLVSMG IVGGTIYSGV MFI