Gene Sde_3548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3548 
Symbol 
ID3966410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4509412 
End bp4512354 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content53% 
IMG OID637922645 
Producttype II secretion system protein C 
Protein accessionYP_529015 
Protein GI90023188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT ACTGGCTCTA CACTTTCACC TCTGCCTGCG TGCTAGCTGC ACTTGGCGGC 
TGTGATGCGA GCAACGGCGA TTTACAGGGT GGCAGCTCCA GCGGCCAAGA CCCCGACCCA
GTGGTAGTGG ATTTCCCTAT TGCTTATGTC GAACGCCCCC TGCCGCGCGA CGATGAGTCT
GGCATGCTAC TAGCCGACAA CGTATTGGAC CCAGCCGCCT TTAAGCCAGG CGCAAAACTT
ATTATTAAAG ACCGCGCCGC AGTGGCAGCC AGTGAGATAG TACTTACCGC AGGCGTTTTT
GGCAGCGCCA ATACAGATGG CGACGAACCA CTCTACGATG TAAAAGATCT AGAAACGTCG
GACGATGGCT TAAAACTTAT ATTTGCCATG CGCGCACCCG CCGATGAAGA TGCCGACGAA
GATGAACAAC CCACTTGGAA TATTTGGGAA TACGACCGCG AAACCGACGT TTTGCGTCGT
ATTATTCAAT CGGATATCGC CGCAGAAGCG GGCGAAGATG TGGCACCTTA CTATTTACCC
GATGGCCGCA TAGTTTTTTC CTCCACCCGC CAACGCCGCA GCAAAGCTAT ATTGCTGGAT
GAAAACAAAC CTCAGTTTGC AGCGCTTACC GAAGACAGAG AGCAAGAAGC ATTTCTACTG
CACGTAATGG ACGACGACGG CGCTAATATC GAACAGCTTA CATTTAACCA AAGCCACGAC
TTGCACCCAA CCGTGCTCGA TAACGGCCAA ATTCTATTTT TGCGCTGGGA TAATTTCGAT
AATCGCGGCG GTATGGATCG ACTCAGCCTG TACACCATTA ACCCCGACGG CACCAACCCC
GCCCTTGCGT ACGGCTACCA TTCACAACTT ACTGGCACCA ACAGTACTGA AGGCGTTTTT
AACCAACCTC GCCAGCTACA AAACGGCAAG GTGCTAGTGA ATTTACGTCC GCGCGAATTC
GATCAACTAG GCGGCGACTT AGTTGAAATA GACATAGACG GCTATACAGA CCTCACCCAA
GCTGCTGGTA GCAACATGGG TGGCGTCGGC CCTGCGCAGG AAGTGGCCAC CAACGCCCCT
GTGCACACCG ACGGCACACC CTCGCCCCAC GGCTTTTTTA GTAGCGCCTA CCCCATGGCC
GACAACACCG GCCGCTACTT AGTAAGCTGG AGCCCATGCG TGGTGCGCGG CTACCGTTTT
GGCACCTATG TAAATGCTTC GCTGCAACTC ATCGACGTCG CCGGCGACTT TGTTAATCGC
GATGGCGAAC TGCTTGCAGA AGGCCAAACG GCCATCACAA TCGAAGAAGA CGAAGTAGGC
ACCTTCCCCT GTAACGATCG CACACTTGCC AGTACCGCTG TCGTTTTAGC CGAGCCCATG
TACGGCTTAT GGGTGTACGA CCCCGTTATC GCCACCCAAT CGCCCGTAGA TTTAGCTACC
CCAAATCGCA TGGTTACCGA AGCGGTTATT ATGGAGCCGG TGACACCCGC CACCCACCTA
GAAACCACAT TGCGCGACCA AGATAGAGCC GCACTGGTAG AGCAAGATGT AGGCGTTGTG
CATATTCACA GCGTTTACGA CTTAGACGGC GAAGACACCT CCCCCGCAGG CATTATCGCC
ACTGCCGACC CCATGCAAGT CGCCCCCGAC GAGCGCCCTG CACGCTTTCT GCGTATATTA
AAAGCGGTAT CGCTGCCCGA TGACGACGTA TTCGACTTTA ACCTAGGCGT TGCAGACGGG
GCGGGTAACT TCCGCATGAA AGATATTCTC GGCTATGTAC CGGTGGAGCC AGATGGCTCG
GCCATGTTTA AAGTGCCCGC CGATGTGGCC GTTACTTTTT CGGTGGTCAA TGCCGAGGGC
AAGCGTATCT CCGAGCGTCA CCAAAACTGG TTTACTGTGC GCGCAGGCGA GGTGCGCCAA
TGTACAGGTT GCCACGTGCG CGACTCCGAA GTGCCCCACG GCAACGCAGC CAAAGGCTTA
GACTCTATCA ATATGGGGGC GCTCTCTAGC AGCCACTTCC CCAACACCGT ATTGCTAGAC
ACCCTCGACC CCATGAACCC GCTGCCGCAA TTGCCGCCCA ATGTGGGCGA AACCATGGCG
GCCTACTACG CGCGTATAAA CAGCAGCAGC TCCTCCACGG TAGACGGCGC ACGCACACCA
TCGGTAAATA TTTACTTTGA AGATGAATGG ACAGATGAAA CCACGGGGTT AACCAAGGCA
GATATAATCG ATTTCAGCTA CAACGATTTA AACTCGCCAG CGCCAACCAC CAATTCCGCT
TGCATCAATA ACTGGAACAG CCTCTGCCGC GTAGTGATTA ACTACGAAGA TCATATTCAC
CCCATCTGGG AAGCTTCGCG AATGATAGAC AGAGGAATGG GCGAGGAGAA CTTTACCTGT
ACCACATGCC ACAGCGCAGT AGACGAAGCC GGGCTACAAA AATTGCCCGC TGGTGAGCGC
CAGTTAGAGT TAACCGGCGA ACTATCACAG GTTAACAATA ATTATATGGT GTCCTACGCT
GAACTATTCT TAGCTAGCCC AGTGTACGAA CTCAACCCAG AAACAGGCAT ACTGCAACCT
GAAACAGAAC ATAAACAAGT TGACGGCGTA TTCGTGTACT ATGCGAGACA AACCATCGTA
GACGAAGACG GCAATGAAGA AGTTATCGAT ATTGAAGCGC CGTTAGACAC CGAGTTTGAA
TTAGTTTTAG ATGAAGAAGG CCAACCCATA CCGGTTATAG TAAATACCGA TCGCACCTTT
GGACGCATCA TGGTGCCGGG CCGAGCCCTA TCAAGCCAAG TGTTTTTCAA CACGTTTTCG
CAAGCAGGGG CAACGGTTGA TCACCGCGGG CTATTGAGTG GCGCCGAACT AAAACTTATA
TCAGAATGGC TGGATATAGG TGCACAGTAT TACAGCAACC CATTTGATGC ACCGCCCGAT
TGA
 
Protein sequence
MNKYWLYTFT SACVLAALGG CDASNGDLQG GSSSGQDPDP VVVDFPIAYV ERPLPRDDES 
GMLLADNVLD PAAFKPGAKL IIKDRAAVAA SEIVLTAGVF GSANTDGDEP LYDVKDLETS
DDGLKLIFAM RAPADEDADE DEQPTWNIWE YDRETDVLRR IIQSDIAAEA GEDVAPYYLP
DGRIVFSSTR QRRSKAILLD ENKPQFAALT EDREQEAFLL HVMDDDGANI EQLTFNQSHD
LHPTVLDNGQ ILFLRWDNFD NRGGMDRLSL YTINPDGTNP ALAYGYHSQL TGTNSTEGVF
NQPRQLQNGK VLVNLRPREF DQLGGDLVEI DIDGYTDLTQ AAGSNMGGVG PAQEVATNAP
VHTDGTPSPH GFFSSAYPMA DNTGRYLVSW SPCVVRGYRF GTYVNASLQL IDVAGDFVNR
DGELLAEGQT AITIEEDEVG TFPCNDRTLA STAVVLAEPM YGLWVYDPVI ATQSPVDLAT
PNRMVTEAVI MEPVTPATHL ETTLRDQDRA ALVEQDVGVV HIHSVYDLDG EDTSPAGIIA
TADPMQVAPD ERPARFLRIL KAVSLPDDDV FDFNLGVADG AGNFRMKDIL GYVPVEPDGS
AMFKVPADVA VTFSVVNAEG KRISERHQNW FTVRAGEVRQ CTGCHVRDSE VPHGNAAKGL
DSINMGALSS SHFPNTVLLD TLDPMNPLPQ LPPNVGETMA AYYARINSSS SSTVDGARTP
SVNIYFEDEW TDETTGLTKA DIIDFSYNDL NSPAPTTNSA CINNWNSLCR VVINYEDHIH
PIWEASRMID RGMGEENFTC TTCHSAVDEA GLQKLPAGER QLELTGELSQ VNNNYMVSYA
ELFLASPVYE LNPETGILQP ETEHKQVDGV FVYYARQTIV DEDGNEEVID IEAPLDTEFE
LVLDEEGQPI PVIVNTDRTF GRIMVPGRAL SSQVFFNTFS QAGATVDHRG LLSGAELKLI
SEWLDIGAQY YSNPFDAPPD