Gene Sde_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1569 
Symbol 
ID3965097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2012760 
End bp2016101 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content49% 
IMG OID637920647 
Producthelix-turn-helix, AraC type 
Protein accessionYP_527043 
Protein GI90021216 
COG category[R] General function prediction only 
COG ID[COG3401] Fibronectin type 3 domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGGC CTAACCCGTT TTATGGCCAC ATGGCCAATA TAGATTGCAC GGCTAACTAC 
CTACCATATG CCAAGAACAT ACTATTGGAC TTGGATATGA CACACGCCAT TCGCCCTATT
TCGTTATTAA GTATTTTGCT TTGCCTGCCG CTATGGCTAA CAGCCTGTGG CGGTGAGCCT
GTTGGCGGGG TAGATAACGC CCCATTAGTA ACAACCGAAA CACCGGAAAA CAACCAGCCG
AGTACTGGTG AGAGTACTGG TGAGCCAGCC ACCGAGACCA ACACACAAGA AGAAGTCGAC
CCGCCTCAAC GACCGCCGGG TGATATTATT TTGGGTAAAA CTACCTATAC GGGCCAATGT
GCCATGTGCC ACGGCGAAGC GGGTAACGGG GCTTTCCCAT TATTTATAAA CGCCGTTGAG
TTTGAAGCAA TAGAAGATGT GACCCATCGC ACCATGCCGT TTGGCGACGC AAGTGCCTGT
GAGGGAGAGT GCGCTACCAA CGTTGCAGCC TATTTGGTTA GCCTAATTAG CATTAGCCTT
CCCGAAACAC CAGCCGTAGA GCCACCAGAT AATTCTACCG AGACCGAGAC CGAGACCGAG
ACCGAGACCG AGACCGAGAC CGAGACCGAG ACCGAGACCG AGACCGAGAC CGAGACCGAG
ACCGAGAATG AAAACAACAG CGGTGACCCC AGTCAACCTG TTACACCAGC TTTACCCGCA
CAATGCGCTA ATCCGCAGTG GGTTAGCGGC ACTGAATATG CTGTAAACAC CCTCGTTGTT
AACCGCGAAG CCGCGTTCCG TTGTGTGATA GCTGGCTGGT GCGCAAGCAC GGCCAGCTGG
GCCTACGAGC CCAACAGCGG CCTGTATTGG GAGGAAGCCT GGGAGAAAGT CGCACTTTGC
AGCGAAAGCA CCAGTAACGG CAACACCTCT GGTGAGCCAA CCACACCCAA TACCGATGGC
GAAGAGCAAA CAGAAAATAA CGGTGCTGAA ACTGGAACTG AAAACGAGAC TGAGACTGAA
ACGGAACCCA ACGAAAACGC CTCTGCCCCT AAAGCAGTAA GCACAGTGAT AGCCGTCGCT
TCAGCCAATC AAGATCGCAT AGGCCTAAGC TGGACAGACA ACGCAAACAA CGAAACAGGT
TTTGCAATTC GTCGCAGAAC CAATAACGGC GCTGTAGTTC AAATACACAA CGCCCCTGCC
AACTCTACAA GCTATACAGA TGCAAATGTA AAGCTAGATA ACCAATATCA ATACGACATT
ATCGCGTTTA ATGCCGTAGG CTCGAGCGCT GCGGTAGAAA GCAATCAAGT ATCGCTAGTA
ACACCCGTTA CGGCACCTAA TGCAATCACC AACCTAAACG CTACATTAAA TAACAATCAA
ATATTTTTAA GCTGGAGCCA AGCAGACGCA ACTGCAGACA CCATCGCTAT TTATCGCACG
CTAGACGATA TTCAATGGCA GCAGCTTACT ACCGTAGCCG CTAACACCAC CCAATATAAC
GATACCAATA TCACAACCAA CACCACCTAT GGTTACCGGC TAGTTGCCAT AAATACTGCG
GGGGAATCGC AAGCCAGCAA TACAGTGCAT GTATCTGTTA CCCCTATTGC AGCTGGGCAA
ACGCTATTTA ACCAGCATTG TGCGGCCTGC CATAGCGCGT CTGGGATTGG CGGTGATTTA
TTTTCAAGCC AAACACAAAC TGCATGGCTG AATAAAACAT TAAGCCAGTT AGAAACAAAA
ATTTCGACCA TGCCTGCGCA GCAATGCGAC GCCAACTGCC AAAAAGTGGT TGCCGATTTT
ATTTGGCAAG ATAAATGGAA CCGCGTGGTA GATATTATTG AAGAGAAAAT TACCAGCTCT
GGCGTACGCG GGGTGCGCTT ACTTACGCCA TACGAATATG CCAACACAAT AAAAGCGACG
CTAAACGTTA CCGTTAACAG CGAAGACCTA CCAAGCGCGC GCTTCGATAG CCACTTTAAA
TACCCAAGCC AATCGTCGCA GGGCCTAGTG CTTACCGATG AAGCCGCAGC CTACCAACAA
CTCGCGCAAG ATATTGCAAG TAAAACCACC ATAACCAAGC ACACCTGCAG CACAGCAAGC
TGCAGACAAA ATGCGGTAAA CAAGTTGGGG TTAACACTTT TTCGCAAGCC ACTTACCGCT
GCACAAACGG CAAGCTACAG TACATTATTT GAAACCAATG GTTTTGAATC TGTGATTACC
AGTATGCTTA TGTCCCCCTA CTTTTTGTAC CTAACAGAGT TAGGCAAGTG GGACGCGCAA
ACCGAAAGCT ACCGTTTAAG TAACTACGAA ATTGCTACGC ATCTATCATT TAGCCTTTGG
GGTATGCCAC CCAACAGCAC ACTGCTAAAT CTTGCAGCCA CAGGCACGTT CAGTAGTGAT
GAGGGTGTAA AAAACCAAGC GCAAACAATG GTGAATGACG CAAGGTTCGC TGCACACGTA
AGCGAATTTA TACGCTACTA CGCCAACACC TATAGTGTTG TGGACGAAAA ACCAGGTTTA
TCGACCGATG TAATAGCCGC TATGCAACAG GAGCAAACAG AGGCGATACA CTATTTAATT
AACGCAGGTT CTGCTTCATT TTACGAGCTA CTCAACCCAA GCTACACCTA CTTAAACAGC
ACACTCGCCA ACCATTACGG TTTAACGTCT ACTGCGCAAA ACGCTGGTAG CAGTATGCAA
AAATATAACG TTAATTCTCT GCGCGGCGGC CTTATGCATC AAGGTATATT CCAGGTTAGC
AATTCAGATT TTAGTGCAAC CTCACTGGTT AAACGCGGCA AGTTTATTCG CGAAAACATG
CTCTGCCATA TGATGGGCAC CCCCTCGGGC GTAGACCCAG ACACCATCAC GCTACCAGAG
CACCCAATAA CAACTCGCGA ACGCTGGGAT GTAATAACCG GCCCCAATGC AAGTGACGGC
CAATGCTGGC AGTGCCACCA ATTAATGAAC GAGCCGGGTA GCGCGCTAGA AAATTACGAT
CACGCTGGCC GCTACCGCAC AGAAGAATCC GCAGCGAATG ACAGCAGGGT TGCACTAACC
ATAGATGCTA GCGGTATTCT GCGCGATAAC TCAGGCTTTA ATACGCTAAC CCAATACGCA
GATGCACGAG CACTTAGCGA ATACCTTGCA GTTTCAGAGC AGGCACTAAG CTGTTTTGTT
GATAACGCCT ACCGGTTTAC CACCGGCCAA CAAACGGATG CACAAAGCGA AAATGCAATT
AACGCACTGC AACAAGATTT TATTATTGAC GGCGATATAA AAACACTATT TATAGAGCTC
GCAAGTAGCC CTGCCGCCCT GTACCGATCT GATAGAGATT AA
 
Protein sequence
MPRPNPFYGH MANIDCTANY LPYAKNILLD LDMTHAIRPI SLLSILLCLP LWLTACGGEP 
VGGVDNAPLV TTETPENNQP STGESTGEPA TETNTQEEVD PPQRPPGDII LGKTTYTGQC
AMCHGEAGNG AFPLFINAVE FEAIEDVTHR TMPFGDASAC EGECATNVAA YLVSLISISL
PETPAVEPPD NSTETETETE TETETETETE TETETETETE TENENNSGDP SQPVTPALPA
QCANPQWVSG TEYAVNTLVV NREAAFRCVI AGWCASTASW AYEPNSGLYW EEAWEKVALC
SESTSNGNTS GEPTTPNTDG EEQTENNGAE TGTENETETE TEPNENASAP KAVSTVIAVA
SANQDRIGLS WTDNANNETG FAIRRRTNNG AVVQIHNAPA NSTSYTDANV KLDNQYQYDI
IAFNAVGSSA AVESNQVSLV TPVTAPNAIT NLNATLNNNQ IFLSWSQADA TADTIAIYRT
LDDIQWQQLT TVAANTTQYN DTNITTNTTY GYRLVAINTA GESQASNTVH VSVTPIAAGQ
TLFNQHCAAC HSASGIGGDL FSSQTQTAWL NKTLSQLETK ISTMPAQQCD ANCQKVVADF
IWQDKWNRVV DIIEEKITSS GVRGVRLLTP YEYANTIKAT LNVTVNSEDL PSARFDSHFK
YPSQSSQGLV LTDEAAAYQQ LAQDIASKTT ITKHTCSTAS CRQNAVNKLG LTLFRKPLTA
AQTASYSTLF ETNGFESVIT SMLMSPYFLY LTELGKWDAQ TESYRLSNYE IATHLSFSLW
GMPPNSTLLN LAATGTFSSD EGVKNQAQTM VNDARFAAHV SEFIRYYANT YSVVDEKPGL
STDVIAAMQQ EQTEAIHYLI NAGSASFYEL LNPSYTYLNS TLANHYGLTS TAQNAGSSMQ
KYNVNSLRGG LMHQGIFQVS NSDFSATSLV KRGKFIRENM LCHMMGTPSG VDPDTITLPE
HPITTRERWD VITGPNASDG QCWQCHQLMN EPGSALENYD HAGRYRTEES AANDSRVALT
IDASGILRDN SGFNTLTQYA DARALSEYLA VSEQALSCFV DNAYRFTTGQ QTDAQSENAI
NALQQDFIID GDIKTLFIEL ASSPAALYRS DRD