Gene Sde_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3748 
Symbol 
ID3966783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4747108 
End bp4748379 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content50% 
IMG OID637922845 
Productregulatory protein, ArsR 
Protein accessionYP_529215 
Protein GI90023388 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.857978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGGT CCCTCATCGT CGCCACGGCG ACCACATTGT GCCTTGTTAG TGGCTGTACT 
ACCAACTCTG TGACAGGCGA ACGCCAGTTC CATACTATGT CGTTTGAGCA ACAAGTTGCC
TTGGGTTCTG AACAGTACAA CCCCTCGCAG CAGCAGCAGG GCGGGCGCTA CGTAGTAGAC
CCAGAGCTAA ACCTGTATGT AAGTAACGTA GGCCAAAAAC TGGCAGCTTA CTCCAAGGTA
AAACTGCCCT ACGAGTTTGT GGTATTGGAT AACGATGTAC CCAACGCATG GGCGCTGCCC
GGTGGCAAAA TTGCCGTTAA CCGCGGCCTA CTTATTTTAC TTGAAGACGA AGCCCAGCTC
GCAGCGGTAT TGGGCCACGA AATTATTCAC GCCGCGGCAG AGCACGGCGC ATCGCAAATG
ATGAAAGCAC AAATGCTGGG CCTTGGTGTT GCAGCGGCCG GCCTAGCCAG CAAAGATAGC
GACTACGCCA CCTATATTGG CCTCGGCACC GCAGTAGGCG CGCAATTGTA CCAAGCCCAC
TACGGTCGCT CGCAAGAGTT AGAATCGGAC AAATATGGCA TTGAGTACAT GGTTAAGGCC
GGCTACGACC CGCAAGCCGC TGTAGAGCTA CAGCAAACTT TTGTGAAATT ATCTGAAGGT
CGCCAAAGTG ACTTATTAAG CAATTTGTTT GCTAGCCACC CACCTTCACA AGAGCGCGTA
GAACGAAATC GCGAGCTGGC GGCTAAGCAT TCAGGCGGCG TGCGCAATAA AGCCGCGTAC
CAAAAGGCCA TTAGCCAACT TAAAAAAGAT AAACCCGCCT ACGATGCGCA CAACGCCGCA
CAAGAAGCAG CAAGTAAAGA AGACTTTGCA AGCGCCCTAA GCAACGTAAA TAAAGCCATT
AAATTACAAC CCCAAGCGGC GTTGTTTTAC ATTACCAAAG GCAATGTTTT ACGCGCACAA
AAAAATAACA CCGAAGCCTT GGCGGCTTAC CAAAAGGCGC GTCAGTTAAA CCCAGATTAC
GTAATGGGGT ATTTAGGCGA AGGAATTACT GCATTTAACT TAGATAAAAA GAGTGTTGCC
AAAACGGCAC TGGAAAGCAG TATGAAGATA TTGCAAACCC CAATTGCAGC ATTCCACCTT
GGTGAAATAG CGCGCGAGAA AGGCGATAAG CAAACTGCCC TAAGCTACTA TCAATTTGCC
GCCAACGACA AAGGTGAATT GGGTCAAGCT GCGCAAGAAC GCGCTGCACA AATGCAAGGT
TTAGCTCAAT AG
 
Protein sequence
MFRSLIVATA TTLCLVSGCT TNSVTGERQF HTMSFEQQVA LGSEQYNPSQ QQQGGRYVVD 
PELNLYVSNV GQKLAAYSKV KLPYEFVVLD NDVPNAWALP GGKIAVNRGL LILLEDEAQL
AAVLGHEIIH AAAEHGASQM MKAQMLGLGV AAAGLASKDS DYATYIGLGT AVGAQLYQAH
YGRSQELESD KYGIEYMVKA GYDPQAAVEL QQTFVKLSEG RQSDLLSNLF ASHPPSQERV
ERNRELAAKH SGGVRNKAAY QKAISQLKKD KPAYDAHNAA QEAASKEDFA SALSNVNKAI
KLQPQAALFY ITKGNVLRAQ KNNTEALAAY QKARQLNPDY VMGYLGEGIT AFNLDKKSVA
KTALESSMKI LQTPIAAFHL GEIAREKGDK QTALSYYQFA ANDKGELGQA AQERAAQMQG
LAQ