Gene Sde_2992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2992 
Symbol 
ID3967753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3805391 
End bp3808339 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content46% 
IMG OID637922089 
Producthelix-turn-helix, AraC type 
Protein accessionYP_528461 
Protein GI90022634 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0557584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000421127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAACA TATTTCATTT AAAGAAAGCC GCGTTTATTT TTTGCGCTTT AACCAGCTGT 
TATAATTCGA CAGCGTCATC TCAACAAGAC GATTACAGTG TTACAGCTAC TGTATCTACC
GAATTTAATC CCATGAGCTC GAGTTGGTAT ACCAACCCGT GGCCCGAATC GGATATTCCT
CGCAGATTAG AACAACTAAC ACCTAGCGTT ATTACTCAGC TTGGGCAAAC GTCTGGTACG
TTATTAGAGG TGGACCCTTC CACAACGTAT CAAACGTTGC TTGGTTTAGG TGCGTCGCTA
GAGCATACAA CGGTTTACGC TATTCGAAAA AACAAAACAG CAGAACAACA AAAAGAAGTA
TTGCGTTCAC TTATCGACCC TGTGCAGGGC ATGGGAATGA ATTTTTTTCG TGTATCAATA
GGCACGTCAG ATTTTGCAGA TGGAACACGC GCAATACCAG CGCCGGATAA TGCGAAGGGG
TGGTATTCCT ATCAAGATAC ACCCACGTCG CCGTTTTCTA TTGCTCGCGA CGAAAGCTTA
GGCATTATTG AAACTATTCG TATGGCGGTA GAGGTTGGCG TAGAAACAAA TAATGAATTA
AAAATTCTCG CTTCCCCATG GAGCCCGCCG CGCTGGATGC GTGAAGGCGA TAACATGGTA
GATGGCGGCC CGCTTAAAGC GGATATGCTC GATGACTACG CGGCCTATTT GCGTAAATTT
GTAGAAGCTT ATCAAGCAGA GGGTATTCCC ATTTATGCCT TAAGCATGCA AAACGAACGT
CAGTTCGAAC CAGGGGCTTA CCCCGGCATG GTTATAACAT GGCAAATGGA GCGCGACCTA
CTTATAGAGG TATACGAAAA CTTTCACAAT ATTGATGGCA ACTATGGCCC AGAGCTTGAT
GTAAAACTGT GGACACTAGA CCATAATTTT GATTATTGGC AGCAAGCGAA ATTGCAGTTG
GATTCGTTCA AAGCAATGGG CAAAGACCAT TACGTAGATG CCACTGCATT CCATCACTAC
GGTGGTGTGT CTGAAAATAT GGGGCAGCTG CACGATGCTC ACCCAGATAA AGACGTGGTG
TTTACCGAAG GTACTATTTG GGGGTTAAGT TCAGATGGTA ATAAGCGCAG CTACGAAGCA
CTTATACGTC ATTTTCGCAA TTGGGCTACT GGCTATCTTT CGTGGGTAAC AATGACAACC
CAAACTCTAA ACGAAGCAAA CCAAGGGCCA TATAACGGCT TAGGTGCATT CGATCCCACG
CTATTGGTTA AATATGATGG CGACAACGCC AATTGGTATA AAACGCCAGA ATATTGGTTG
ATGAGTCAAT TCAGTAAATA CCTAAAGCCC GGCGCCCTGC GTATAGAAAG TAATTACGGT
TCGTTGCAAA CCGTTACCAA TGTTGCGTTT TTGAACCCTG ATGGCTATGT CGTTTTAATT
GTGGCTAATT CCACCAACGG CGTGCAGCAA TTTGATGTGA TTAGTGAGGG GAATCAATTT
AATGCATCGG TACCTGCGCG TTCGATTGCA ACCTATCGCT GGAAAGCGGG GTTGGGGCAA
AGCCCGCACT CGTGGCAAGC ACCGCCCGAA TTGCCTCAGT TCCCATACGC AGAAAACTCG
ATTGAAATTC CTGGGTTGGT AGAGGCTGAG CATTACGACC TTGGTGGCGC GGGTGCAGCG
TACGCAGATG TAAGCACTGG TAACAACGGC GGCGTATTGC GCGCAGATGA TGTGGATATA
GAGGCAAACG CAAACGGATA TCACATTGGT TGGTTAGATG CCGGCGAGTG GCTTGAGTAT
TCCGTTAATG TAAACCAAGG GCAAAGCTTT GATGTGTTAA TTGCTAGTGC GTCTGCAAGT
AGCGGCGGGC AGTTTCATTT TGAAGTAAAC GGCGAGTCGG TTTCTCCAAT ATTAGTAACA
CCTGCAACTG GTGGTTGGAA AACGTTTGTA AGCACGCTTC ACCGTGGGCT ACAGCTGAAT
GCAGGTGAGC AAGTGTTGCG CTTGGTAATA GATGGTGGCG AATTTAATAT TGATTCATTT
CACATTGTGC CAGCAGGGAG CATGGAGCCA CCTGTGCAAG AGGATATTTG CGAAGAGGCG
ACGGTAAGTA TCCTAGCAGG AAAAATCCAA GCGGAAAGTT ATTGTCTAGC AAGCGGTATT
CAAACAGAAA ATACTAGCGA CCAAGGCGGT GGCGAAAATA TTGGTTGGAT AGACGCTGGT
GATTCGGTGG ACTACGGCGT GAGCGTGGCG AATGCGGGTA GTTACACTCT TAATTTACGC
GTAGCAAGTC AAAATGGAGG TGGAGAAATT GCGCTAAGTG TTGGTGATAC TGTTTTGGCA
AATGTGCAAA TACCAGGCAC AGGCGGTTGG CAGAATTGGC AAACTATTAG CGTGCCTGTA
CAGCTCTCTG CTGGGTATCA ACAGCTACAT TTTGTTTTTA TTAATGGTGG CTTAAATATT
AATTGGTTTG AATTTGTAGA AACTGATAAT CCAACAGATC CAACAGATCC AACAGATCCC
GCAGCAGAGC TAGAAGAGGG TGCCTACTAC ATAATTAATG AGGCTTCTGG CAAGGCGCTA
GATGTATCTG GTGTATCTAC CAGTAACGGT ACCAATGTTC AGCAGTGGTC ATACAGCGGC
GGTTTGAATC AGCAGTGGAT TGCCCAGCAC GTAAGTGGTA ATACATTTGA GCTGGTTAGT
TTAAACAGTG GCTCTTGTTT AGATGCAGAT AATGGAAGTG ATAATGCACA CCAGTGGGCT
TGCGAAGGCA ACACCAACCA GCAGTGGGTT ATTGAAGGGC AATCGGACGG CACTTATTTA
ATTCGTACCA AAGCCGGTAA CGAAGTATTG GAGGTGCAAG GTGGCAGCGC TAACAACGGT
GCAAATGTGC GTACTGCCAG CTCAGTAAAT AATAATCGTC AGAAGTGGCG GTTTAATGAT
GTTGAGTAG
 
Protein sequence
MENIFHLKKA AFIFCALTSC YNSTASSQQD DYSVTATVST EFNPMSSSWY TNPWPESDIP 
RRLEQLTPSV ITQLGQTSGT LLEVDPSTTY QTLLGLGASL EHTTVYAIRK NKTAEQQKEV
LRSLIDPVQG MGMNFFRVSI GTSDFADGTR AIPAPDNAKG WYSYQDTPTS PFSIARDESL
GIIETIRMAV EVGVETNNEL KILASPWSPP RWMREGDNMV DGGPLKADML DDYAAYLRKF
VEAYQAEGIP IYALSMQNER QFEPGAYPGM VITWQMERDL LIEVYENFHN IDGNYGPELD
VKLWTLDHNF DYWQQAKLQL DSFKAMGKDH YVDATAFHHY GGVSENMGQL HDAHPDKDVV
FTEGTIWGLS SDGNKRSYEA LIRHFRNWAT GYLSWVTMTT QTLNEANQGP YNGLGAFDPT
LLVKYDGDNA NWYKTPEYWL MSQFSKYLKP GALRIESNYG SLQTVTNVAF LNPDGYVVLI
VANSTNGVQQ FDVISEGNQF NASVPARSIA TYRWKAGLGQ SPHSWQAPPE LPQFPYAENS
IEIPGLVEAE HYDLGGAGAA YADVSTGNNG GVLRADDVDI EANANGYHIG WLDAGEWLEY
SVNVNQGQSF DVLIASASAS SGGQFHFEVN GESVSPILVT PATGGWKTFV STLHRGLQLN
AGEQVLRLVI DGGEFNIDSF HIVPAGSMEP PVQEDICEEA TVSILAGKIQ AESYCLASGI
QTENTSDQGG GENIGWIDAG DSVDYGVSVA NAGSYTLNLR VASQNGGGEI ALSVGDTVLA
NVQIPGTGGW QNWQTISVPV QLSAGYQQLH FVFINGGLNI NWFEFVETDN PTDPTDPTDP
AAELEEGAYY IINEASGKAL DVSGVSTSNG TNVQQWSYSG GLNQQWIAQH VSGNTFELVS
LNSGSCLDAD NGSDNAHQWA CEGNTNQQWV IEGQSDGTYL IRTKAGNEVL EVQGGSANNG
ANVRTASSVN NNRQKWRFND VE