Gene Sde_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1970 
Symbol 
ID3967135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2477202 
End bp2480264 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content43% 
IMG OID637921058 
ProductTnpA family transposase 
Protein accessionYP_527442 
Protein GI90021615 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACACCA AAAAGATCAG TTCCACGGAT AAAGAATTCC ACGATACACC GCCGCAATTC 
AAAGCTTTAG ACCGCAAGCG TTTTTTCTAC GTCGAATCTC ATTTCAAACA GCTCATCACA
CAAAACGTTC GCGGCCATGC CAATATCGTT TACATCATCG TCGCCTACGG GTACTTCAAA
GCCACGGGTC AATTTTTTAA TTCTGCGATC CAGGAAGATG TTGACTATGT AGCAGGCCGA
CTGAAGCTCA GGCAAAGTTT TCGCTGGGAA AGCTACAATA CAAGCACCAG AAATCGCCAT
AGAGAAATGA TTCTTGATGC GCTGGGTTTT AAGCCATTTA ATCAAACCAC CATTCAACCA
CTGTTGCCAC TGATAAGAAC GGACGCACGC TCTCAGAAAA ACCCCGATAA ATGCTTTATT
TCCGTTTGCG AATGGCTCTT TGAACACAAA GTCGAGACAC CCGATTATAA AACAATTTGT
GACACGATCG AGTCGGTCTA TAAAACGCAC ATCGACCAAC AGATTGCCAT TGTTAAAAAT
AAGCTCTCGA ACGATGGAGC TCAAAAACTC GATGCGTTAT TTTCAAAAGG AAAAAATACT
TATGATGAAA TGGAAACGTA CAGACTTACG CTGCTGAAGA AACTTAACCA GTCAACCAAG
ACGTCTAAAA TACGCGAGAA TGTCGCAATA TACGATACGC TCAAGCCACT CTACGATATT
GCACTTCCAA TCTTAAGCGA GCTGGATTTT ACAAAAGATG GACTTAAACG CTACGCACTA
TCCGTTCATC GGCGGCAGGT TTTTCAGGTT AATCGTTTGA AAGATCATGA TCGGCATCTT
CACCTTGCCA TTTTTATCAC TCACCAATTT CAACAGCTGG TCGATATTCT GGTCGAAACG
TTTCTGGCCT CGGTGAAAAC AGCAGTGAAC AAGGCCGAGA ATCTTGCCAA AGAAGAATAC
TACAAACAAC GCCAGGAGCA ATCTGGCCAT ACGCAAGCGT TAGTAAAGGA TACGGTTGAT
TTAGTGTCGT TGGTGGAAAA ACTTAAAGAG ACGCTGAACA ACCCTATTCT GAGTGATTCA
GAGAAAGTCG AAAAAGCGAT TCGCTTGATA AGCCCGTCAA AGATATCCAC GGAAAACGTA
AACGCTCATG TAAAAGATGT CCAGGAGGAC TTAGATAAAA TCAGCGGACA GGCTCTTGAA
ATGCTCTTTT TGGAAGAAGG CGCCACGTCG CTTCAGCTCA AGTGTAGCGA TATTCTTCGG
CGGCTCAATT TCGATTGCGA CGTGAAAAGC AACAAGCTGC TAAAAGCAAT ATCGCAGTAC
CAAGCGAAAA ACGGAAAGGT TGATGATACA TTCCCAGTCG GGTTTCTTAC GAATTCAGAA
ACCGGCTACG TTGACACGAA AGACTCCTTC CGCGCGAAAC TGTATCGCAT TCTGCTTTAT
AAGCACACCG CCGCCGCCAT TAAAGATGGC ACACTGAGCA TTGTTAACTC AAATCGTTAC
CGACAGCTAG AGCAGTATTT GATTTCGCGG GAATACTTTC AAAAAAACCG AGATCACATA
CTAAATTTAG CGGACATGGC TGATTTTAAA GATGTTAAAA AATTGGTTGA GCACTGGGAA
CTTGAATTGG ATGCGCAATA CAAGGAGACG AACGAGCACA TACTGGAAAA TTTAAACGAA
TATATTGAGA CAGACGGTCT TGGCGGATTT AAGCTAACGT ACGAACGAAA TACTCGGGCA
GAAGTCCTGC TAGAAATGGA AAGTGACGTC GCTCTGTTTC CGGATGATCA ATACATTCGA
ATAGCAGAAG CGCTAAGCAC AGTGAATTCC GCTTCAGGGT TTCTTGACGA GTTTGAACAC
AACAGTATCC GATACCGTAA GCCCAGACCT GAAGATAAAA ATTTCCTGGC GGGAATAATT
GCGCTAGGCG AACACTTGAG CGTGCCAAAA TTATCTAAAT TGGCGAGAGA GATTGAACTG
GCAACTTTGG AATCCACGAC CAATGGTTTT TTTACGCTTG AGAATTTGAG ACGAGCAAAT
GATGCCGTTA TTCGTTTCGT CAATCAACTT CCACTGGCCA AAGTATTCAT TGGAGACTAC
GGCTTGCAGA CGTCGAGCGA TGGTCAGAAA TGGACGATCG CATATGAATC ACTCAATGCA
AATAAATCTT TTAAATACGG CGGACGTGAC CCCGTTGTAT CGGCCTATAC GTTCATAGAT
ATACGAGGGA TGTTCCCTTA CTCAATGGTG ATCAGTGGTG CCGAATTTGA AGCGCACTAT
ATGGTGGATG GGTTATTAAA AAATGATGTA GTGAAATCAG ACCTTCACTC CACTGATTCT
CACGGATACA CGGAAGCCAT ATTTGGGCTT ACACATTTAT TGAAATTTTC TTTTGGGCCA
CGCATAAAAA ACCCCGGCAA ACGAGTTTTA TACTCCTTCA AAACTCCCAC TTTCTATAAG
AAAAAAAGCT ATCCGATCTT TCCAAAAGAG CGGATTAATA AAGATAAAAT TATTGATAAT
TGGGAAGACG TTCTTCGTTT GTGTGCATCA ATAAAGCTAG GCGAAGTTAC GGCGTCTCAA
ATCTTCAAGC GATTGAACTC ATACTCAAAA AACAATCCAC TTTACGAAGC ACTAAAAGCA
TTTGGAGGTA TTCCGAAAAC TTTGTTCTTA CTCCGGTATG CCGACGATGT GGGTATGCGA
AAAGCCATCC ACCGCCAGTT AAATAAAGGC GAGGCAGGGA ATAAGCTGGA TCGAGCTCTG
GCCATTGGAC GGGCAGATTA TGTGCAGACA ATTAAGGAAG ACCAGGAGAT CGCAGAGACC
TGCAAACGAC TACTTAAAAA TGTCATCGTT TGCTGGAATT TTATGTATCT CTCCAAACGT
CTGTCAGAAG CCAAAACAGA TGTTGAGTAT TCCCTGCTAC TAAAAAAGAT TAAGGCGTCG
TCTATGCTGG CTTGGGAACA TTTTGTCTTT CATGGCGAAT TTGATTTTTC ACAGAACTCC
CTAAAAGACT CTCAACAGAT AGATTTACAA AAAATCCTTG ATCCAGACCT GATCAAAGAG
TGA
 
Protein sequence
MYTKKISSTD KEFHDTPPQF KALDRKRFFY VESHFKQLIT QNVRGHANIV YIIVAYGYFK 
ATGQFFNSAI QEDVDYVAGR LKLRQSFRWE SYNTSTRNRH REMILDALGF KPFNQTTIQP
LLPLIRTDAR SQKNPDKCFI SVCEWLFEHK VETPDYKTIC DTIESVYKTH IDQQIAIVKN
KLSNDGAQKL DALFSKGKNT YDEMETYRLT LLKKLNQSTK TSKIRENVAI YDTLKPLYDI
ALPILSELDF TKDGLKRYAL SVHRRQVFQV NRLKDHDRHL HLAIFITHQF QQLVDILVET
FLASVKTAVN KAENLAKEEY YKQRQEQSGH TQALVKDTVD LVSLVEKLKE TLNNPILSDS
EKVEKAIRLI SPSKISTENV NAHVKDVQED LDKISGQALE MLFLEEGATS LQLKCSDILR
RLNFDCDVKS NKLLKAISQY QAKNGKVDDT FPVGFLTNSE TGYVDTKDSF RAKLYRILLY
KHTAAAIKDG TLSIVNSNRY RQLEQYLISR EYFQKNRDHI LNLADMADFK DVKKLVEHWE
LELDAQYKET NEHILENLNE YIETDGLGGF KLTYERNTRA EVLLEMESDV ALFPDDQYIR
IAEALSTVNS ASGFLDEFEH NSIRYRKPRP EDKNFLAGII ALGEHLSVPK LSKLAREIEL
ATLESTTNGF FTLENLRRAN DAVIRFVNQL PLAKVFIGDY GLQTSSDGQK WTIAYESLNA
NKSFKYGGRD PVVSAYTFID IRGMFPYSMV ISGAEFEAHY MVDGLLKNDV VKSDLHSTDS
HGYTEAIFGL THLLKFSFGP RIKNPGKRVL YSFKTPTFYK KKSYPIFPKE RINKDKIIDN
WEDVLRLCAS IKLGEVTASQ IFKRLNSYSK NNPLYEALKA FGGIPKTLFL LRYADDVGMR
KAIHRQLNKG EAGNKLDRAL AIGRADYVQT IKEDQEIAET CKRLLKNVIV CWNFMYLSKR
LSEAKTDVEY SLLLKKIKAS SMLAWEHFVF HGEFDFSQNS LKDSQQIDLQ KILDPDLIKE