Gene Sde_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1542 
Symbol 
ID3965070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1987808 
End bp1989745 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content48% 
IMG OID637920620 
ProductDNA topoisomerase III 
Protein accessionYP_527016 
Protein GI90021189 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTAT TTATTGCCGA AAAACCCAGC CTAGGGCGCG CTATCGCCGA CGTTTTACCA 
AAACCCCACT CGCGGGGTGA TGGTTTTATT CGGTGCGGCA ATGGCGACGT TGTTAGTTGG
TGTATAGGGC ATTTGTTAGA GCAAGCCGAG CCCGAAGCCT ATAACCCAGT GTTCAAGCAG
TGGCGTTTAG AGCATTTGCC TATCATTCCC GATCAATGGC AGCTAAAGCC AAAATCTAAA
ACCCGTAAGC AGCTAACGGC TTTGCGCAAA TTGGTGAAGG AAGCCACCTG CATTGTGCAC
GCAGGCGACC CAGATCGTGA AGGGCAGTTA TTAGTCGACC AAGTAATTGA TTTTCTTGGG
GTGAAAGGTG CAAAGCGCGA ATCCGTTCAG CGCTGTTTAA TTAACGATCT CAACCCCGCT
GCCGTTAAAC GCTCCCTCGA TACATTGCGG TCAAATAAAG AATTTATTCC ATTAGCCACG
TCGGCGCTTG CTCGTTCGCG TGCAGATTGG TTATACGGTA TTAACATGAC TCGTGCCTAT
ACCTTGCAAG GGCGTAAAGT GGGGTACGAA GGGCTGCTTT CTGTTGGGCG TGTGCAAACG
CCCATACTGG GTATGGTTGT AGAGCGCGAT AGGGCAATTG AAGAGTTTGT TTCAAAACCG
TTTTACGAAG TATTAGCACA TTTAACCACT TTAACTGGTG AGCCGACTGC ATTTACCGCC
AAATGGTTGC CAAGCGAGGC ATGCGAGCCC TATATGGATG AAGATGGGCG TGTACTTAGC
AAGAAGTTAG CCCAAAACGT TATTGCGCGC ATAACGGGTG AAAATGGCGT TGTCGAAAAA
GTAGATAAAA AAGCCAAAGC ACAAGCCGCG CCGCTTCCTT ATAATTTGTC GTCGTTACAA
ATAGATGCCG CAAAACGGTT TGGCTTTGCC GCACAAAAAG TGCTGGATAT TTGCCAAAAC
CTATACGAAA AACACAAGCT TATTACTTAC CCGCGTTCAG ATAGCCGCTA CTTGCCGGCG
GATCATTGGC ATCAAGCAAA TGACGTTTTG CAAGCAATAT CTGCCAATGT TGCCGAGCTA
GTGCAGTCGG TGAGCGGTGC AAACAGCAAG TTAAAAAGTA AAGCATGGAA TGATGCTAAA
GTGGATGCGC ACCATGCGAT TATTCCAACT GCTAAGAAAG TATCGTTTTC ATCGATAACC
AGCGAAGAGC AAAAGGTATA CAGCCTAATT GCTCGGCAAT ATGTTTGTCA GTTCTACCCC
AAATGGGAAT ATTCTGACAC GGTAATTCAC CTAAAAATAG CTGGCGGACA ATTTGTATCT
AAAGCTCGTT TAACACATAA ACAGGGTTGG AAGGTACTGT TTGAACGAAA GGGAGGTGAG
CCTAACCCTA AAGATGACGA CGAGTTTCCC TCACTTACGC TACCGGATTT ACACGTGGGG
CAGAGTGTTG CCTGCAGGCA GGGCGAGTTA CTAGAAAAGC AAACCCAACC CCCCAAATAT
TTTACCGATG CCAGCCTGCT TGCCGCTATG ACGGGCATCG CGCGTTACGT GACCGACCCC
GATATTAAGA AAATACTTAA AGATACCGAT GGTTTAGGTA CCGAAGCGAC ACGCGCAGGC
ATTATCGAAT TGCTATTTAA GCGCAATTTT CTTGTGCGAA AAGGCAAGCA AATACACTCC
ACCCCCGCAG CCAGGGGCTT GGTAGCGGCC TTGCCTAGCT CCGCCACAAC ACCAGATATG
ACCGCCCAGT GGGAATTGGT ATTAAACGCA ATTTCTATAC GCGAAGCCCA ATACACAACT
TTTATGAAGC CATTGGTTGC TAGTTTGCAA ACCCTTATTG CCGAGAGCCA GCGCAATTTG
CCAATAGGCC TTAAAGGTGT TGCCGCTAAA CCCAAGGCCT TTAAACGCAG GCGCAAGGTA
AAAGCCAAGT CTGTTTAG
 
Protein sequence
MRLFIAEKPS LGRAIADVLP KPHSRGDGFI RCGNGDVVSW CIGHLLEQAE PEAYNPVFKQ 
WRLEHLPIIP DQWQLKPKSK TRKQLTALRK LVKEATCIVH AGDPDREGQL LVDQVIDFLG
VKGAKRESVQ RCLINDLNPA AVKRSLDTLR SNKEFIPLAT SALARSRADW LYGINMTRAY
TLQGRKVGYE GLLSVGRVQT PILGMVVERD RAIEEFVSKP FYEVLAHLTT LTGEPTAFTA
KWLPSEACEP YMDEDGRVLS KKLAQNVIAR ITGENGVVEK VDKKAKAQAA PLPYNLSSLQ
IDAAKRFGFA AQKVLDICQN LYEKHKLITY PRSDSRYLPA DHWHQANDVL QAISANVAEL
VQSVSGANSK LKSKAWNDAK VDAHHAIIPT AKKVSFSSIT SEEQKVYSLI ARQYVCQFYP
KWEYSDTVIH LKIAGGQFVS KARLTHKQGW KVLFERKGGE PNPKDDDEFP SLTLPDLHVG
QSVACRQGEL LEKQTQPPKY FTDASLLAAM TGIARYVTDP DIKKILKDTD GLGTEATRAG
IIELLFKRNF LVRKGKQIHS TPAARGLVAA LPSSATTPDM TAQWELVLNA ISIREAQYTT
FMKPLVASLQ TLIAESQRNL PIGLKGVAAK PKAFKRRRKV KAKSV