Gene Sde_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3397 
Symbol 
ID3966123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4330475 
End bp4332679 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content45% 
IMG OID637922494 
Productputative non-ribosomal peptide synthetase 
Protein accessionYP_528864 
Protein GI90023037 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.804151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG CCACTGTAGA ATCTTCTGGT TTGCTCGCGA GCGCATCCGC ACAACAAAGC 
TTAACACCTG CACAATTAGG TATTTGGTTG GGGCAGCAAC GTTTAGGGCG CAGCGGGCTT
TACAATGCTG CAGAAGTGTT GTGGTTTGAA GGTGAGCTTA ATATTAATTT GTTTAGCCGC
GCTATTAGCT ACGTATTTGG TCAAGCAGAA GGTTTGCATC AGCGTTTTGT TTTAATGGGC
GAGCGGGCTG TACAAGAGGG GCATTATTCT CCCGAGCATT GTCGTGCGGA AGTAGTCGAT
TTAAGTAAGC AAGGGTTAAG CCTAACTAAA ACACGTGAAA TTATGCTAAG TGAAGCGAGG
CTAGATCAAT CTAAGTGGTT TGATTTGGCC GAGCAGGTGC CGTTTAAAAA TGTAGTGTAT
AAATTATCCA ATAAGCACAC CGCATGGTAT ATGCGAATTC ATCATATTGC CTGCGACGGC
TATGGCTTTG CACTGTTAAG CCAAAAGGTG GTTGATGTAT ATAACGCTTG GCATCAGCAG
GCAAAACCTA CCGCCAATAC GAATGGTGCC AAACAAGTCG CCGGTAGTCA TAAGTTGTTT
GGTGATTATT CCAAGGTGGT GAACGAAGAG CTCGGTTATG TAAAGTCTGA TAGGCACGTT
CAAGCAGAAA AATTCTGGCA AGACTACTTA GAAAACGTAC CTACACCAAT TAGCCTAGCC
GAACGCGACG CGCCCTACGA TTGTTTGCCA ACCAAAGCAG AAGCAGTTAT TCAACGCGAA
GCATTTAATC AATTAAAAAA GCTTGCTTTG CAGCTCGGTA CAACCTGGCC AGAAATAATT
TACGCACTGG TTGCAGCGCA ACTATTTAGA AAAACGGGTG CCACCGAAAT AGTACTTGGC
TTACCCGTAA TGAATCGCTT GGGTCGAGCA AGTATTAATG TACCGGCGAT GGTAATGAAT
ATAATCCCTT TGCGCATAAG TATCGCGCGA GGCGGTAGCC TAAAAAAAAT AATTACAACT
GTGCAGGCAC AAATTAAACA GACTCGGCCC CATCAACAAT ACAGATACGA AGATATTCGT
GCGCGCAGGC AGGCGAATAA TTTAATTGGC CGAGTGTATA GCGCTGTTGT TAATGTTATG
CCTTTTGATA GGCCACTAAC AATGCTTAAT TGCTCTGTTA CTACGCAAAC GCTAGCGGCT
GGCCCCGTAG AAGATTTAGC CTTCTCTGCA GTGGCATTAA GTGATGGCGG CTTGCGGTTG
GCGGTTGAAG GCAACGCCCA TTTATATTCA GAGCCTGCGC TGCAGGCGCT ACTTTCTGAG
TTGAGTAAAA CGTTTTGCGA GCTGTGCGAA GATGTTGCAA ATCAAACTAG TGAGCAATAC
TTACTGGCCG AGCAAGTAAT AGATATAAAC AACTTGTCGT TAATTAGAGG TGATGCGCTT
CCTTTCGTAC CTAGCTCTGT ATTGCACCTA TTCTTACAAC AGGTTGAAGC CAATCCAAAC
CACGCCGCAC TTGTATGTGG GGCTGTGGAA CTAAGCTACC AACAGCTGGC CAATAAAATA
GCGGTAGCAG CGTTCGGCTT GGTTAATTTA GGTGTAAATG AAGGGGCTGT AGTTGCGATT
GAATTGCCGC GAGGTATTGA AGCGATAATA ACGTCGTTTG CTTGCTTAAT ACTTAACGCT
TGTTATGTGT TTGTTGATCC TAGCGGCCCG CAGGCGAGAA ATTCGCGCAT AGTGTCGGAT
GCCACCCCAA GTTTAATTGT GTGTGACGAT GCACAAGTAG AAATTGTAGA CGTTAAAGCT
AATTCAGATA AAATCGGAAA TACTGCAACA CGTTATGGTA TGCGCGCGGC TTGTTACACG
AGTTTATTGC AACATAACCA ACCTGCACCC GACTATATAA ACACACTATG CTCGCAAGTG
GACGCCGATG CCAGTGCCTA TATGGTTTAC ACATCGGGTT CTACCGGTAA CCCCAAAGGG
GTGATTATTT CTAATCGCGC ATTGGCCGAA TATGTTTGTT CTGCAATTTC TCGCTACAAC
ATCACTTCGG CAGATAGGGT ATTGCACTTT GCACCCTTAC ACTTTGATGC CAGTGTAGAA
GAGATATTTT GCAGCCTTTG TGCCGGTGCA TCATTAATTA TTCGTAGTGA AGATATGGCG
CAAAGTTTTG AAATATTTGA GCGCGAGTGT AACCAAAAAA AATAA
 
Protein sequence
MSNATVESSG LLASASAQQS LTPAQLGIWL GQQRLGRSGL YNAAEVLWFE GELNINLFSR 
AISYVFGQAE GLHQRFVLMG ERAVQEGHYS PEHCRAEVVD LSKQGLSLTK TREIMLSEAR
LDQSKWFDLA EQVPFKNVVY KLSNKHTAWY MRIHHIACDG YGFALLSQKV VDVYNAWHQQ
AKPTANTNGA KQVAGSHKLF GDYSKVVNEE LGYVKSDRHV QAEKFWQDYL ENVPTPISLA
ERDAPYDCLP TKAEAVIQRE AFNQLKKLAL QLGTTWPEII YALVAAQLFR KTGATEIVLG
LPVMNRLGRA SINVPAMVMN IIPLRISIAR GGSLKKIITT VQAQIKQTRP HQQYRYEDIR
ARRQANNLIG RVYSAVVNVM PFDRPLTMLN CSVTTQTLAA GPVEDLAFSA VALSDGGLRL
AVEGNAHLYS EPALQALLSE LSKTFCELCE DVANQTSEQY LLAEQVIDIN NLSLIRGDAL
PFVPSSVLHL FLQQVEANPN HAALVCGAVE LSYQQLANKI AVAAFGLVNL GVNEGAVVAI
ELPRGIEAII TSFACLILNA CYVFVDPSGP QARNSRIVSD ATPSLIVCDD AQVEIVDVKA
NSDKIGNTAT RYGMRAACYT SLLQHNQPAP DYINTLCSQV DADASAYMVY TSGSTGNPKG
VIISNRALAE YVCSAISRYN ITSADRVLHF APLHFDASVE EIFCSLCAGA SLIIRSEDMA
QSFEIFEREC NQKK