Gene Sde_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3775 
Symbol 
ID3966830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4775707 
End bp4777329 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content47% 
IMG OID637922872 
Producthelix-turn-helix, AraC type 
Protein accessionYP_529242 
Protein GI90023415 
COG category[S] Function unknown 
COG ID[COG4104] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCGG CGCGACTCGG TGATATAGAT ACCGGTCACC CGCCCTCGCC CCCAACGCCC 
ATTATTAATG GCAGCACCAA CGTACTTATC AATTCTAGGC CGGCCGCGCG CAAAGGCGAC
ATGCTGGTAC CGCACCACCC TGGTATTCGC AAAATTTCCG AAGGCTCTAG CAGCGTGCTT
ATTAATGGCA AGCCTGCTGC GCGCATGCTC GATGGGGTAA ACTGCGGCGG TAAAATTATT
ATTGGTTCTG GCAATGTATT TATTGGTGAT AACCCTAAAA CCGGTGGTGG TGGCGGGGTA
ACCAGCAATA TAAAAGTAGA GCAAGAGTTT GATGAATATA TCGACTCGAA ATTCAAACCA
GAAAACCAAA AGCTCACCGA CTACCAGTGG CGCCAACTAG AAGCCGATTT AGCGGTAAAA
TACCGCGGCA GCGCAGGTGC CGTAGCTACC TGGGCAGAAT ATTATCAAGA GGCAATACCC
GAAGGCCCAC CCCAATCTGC CGCCGAAGCC GAAGGCTTAA AAATAGCCCA AGCACTTAAT
AGCGCAAGCG AGCTAGATGA GCCCGAAGCA GCCGTATACA GCGAAGAAGT GCAAGCCCAA
GTAGACAAAG CCGGCCAAGC GCTGGCCCAA GCCGCCGCCC AACTCCCCGA AGGCGAAATG
ATCACCCCCG AAATGGTACA CGTGGCCGAG CAAGCGCTGG CAGCTACTGG GTATGTAGAG
CAGCCGCACT CGCACAATCA TCAAGCCAGC AAGGTTTCTG ATTTGGCCAG CCGTAAAGGC
GTATCACCTA CATCGCTAGA TGATGCCGCT AACCGCTTAC AAAGCATGGG TCTAGAAATA
AAAGAAAAAG GCTATCAACC CAAATACTCC GACGCAGAGT TAATAGCGCA AGCTAAGGCA
GGAGACGTTG CTAAAGAGCG ATTCCATGTG CGGTTTATGG AGGTGCGCCA TCAGTGGAGC
CGAGAAGATG CCGTTAAGAG CCAAGACAAC TTAACCGGCT TGCTCGGCCT GCCATTGCAA
GGCAAAACGG GCGAGGGCGC AAAATACTGG TCGACAACCT TCGACCAAAT AGAAGATGCC
GATACAGATG CAGAGCTTAT CTGCGGCATT TTGGGTTTAG ATTATAAAAA AGATGCCAAC
TATATGATGG TTGTTGTCGA CACAGAAAAA GCGGCCCCGA TTACTGGCGT AGCAAGCGTA
TCTGCAACAT TTGAAAATGT CAGCGAATTC GCCAATAGGG AACTGCCAGA TGAGTTTCCT
AAGGACTTTA CTGACTTAAC CATGAATGAT GAATATCAGA AAAAATATAA CGAGCTATTT
TCTGCTGCGA TTCAAGAGGG TGTTTTTGAA GACAAGTGGA AACCTAAAGA TGAGGAGCTT
TCTAGCTTTC TCAAAAGTAG AGGTGTCGAT GATGATAACG TGGGTGTATT GGTTAACAGG
CTGAAAATGC ACAGAATTAT AGGCAATAAT CAGTATTATG AGGGTAATGG ATTAACTCAA
AATAAGAATG AAAAAGCTGG TAAAGAGTAT GGTGTGGTAG AAACATTAAA TTTCGAAAGA
AAAAAAATAG ATCTACAAAA ACTAAAAAAT AGTGGTGCAA TTAAAATAAT AGCAATAGGT
TAA
 
Protein sequence
MPAARLGDID TGHPPSPPTP IINGSTNVLI NSRPAARKGD MLVPHHPGIR KISEGSSSVL 
INGKPAARML DGVNCGGKII IGSGNVFIGD NPKTGGGGGV TSNIKVEQEF DEYIDSKFKP
ENQKLTDYQW RQLEADLAVK YRGSAGAVAT WAEYYQEAIP EGPPQSAAEA EGLKIAQALN
SASELDEPEA AVYSEEVQAQ VDKAGQALAQ AAAQLPEGEM ITPEMVHVAE QALAATGYVE
QPHSHNHQAS KVSDLASRKG VSPTSLDDAA NRLQSMGLEI KEKGYQPKYS DAELIAQAKA
GDVAKERFHV RFMEVRHQWS REDAVKSQDN LTGLLGLPLQ GKTGEGAKYW STTFDQIEDA
DTDAELICGI LGLDYKKDAN YMMVVVDTEK AAPITGVASV SATFENVSEF ANRELPDEFP
KDFTDLTMND EYQKKYNELF SAAIQEGVFE DKWKPKDEEL SSFLKSRGVD DDNVGVLVNR
LKMHRIIGNN QYYEGNGLTQ NKNEKAGKEY GVVETLNFER KKIDLQKLKN SGAIKIIAIG