Gene Sde_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2953 
Symbol 
ID3967831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3753453 
End bp3756467 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content49% 
IMG OID637922050 
Productperiplasmic protein TonB links inner and outer membranes-like 
Protein accessionYP_528422 
Protein GI90022595 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR02148] fibro-slime domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.5475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAACCCCAAA ACTTCAAAGT GCAATTTTGG CCTGCAGCCT GCTTACTCTA 
GGCGCGTGTT CGGGTGGGGA TAACGCCGGT ACCGCGGGTG ATACAGCCCC TGGTAATAGC
CAATCCCAAA CGCCGCTTGA GTTCACTGGT GGTACGTCGT TTGTTTATGT TGAGCGCAGT
ATTGCGCAAA TGCAGCAAAA CAAGACGAAT CACTTTAATA ACAAGCTCAA TAACGACCTA
GGCACACCAA CAGATTTAAG CTCGCCTTAC GAATTTAACC CAGGCGCTAA GCTGTATCAC
CGCACTGGCT TAGAAATTGG TGCAATTGAG CTAGATGTAT TAACCGATTA TTTTGGCGGT
ACGGGCTACG ACGTTAAAGA CGTAAGCGTT TCGCAAGATG GCAATACTAT TGTATTTGCG
GCACACGGGC AGGCGGGCAA TAGCGCGCAT TCAAGCTGGA GCATTTACAC CTACGACCTA
ACCAGTCAAA CCATCAAGCG TGTTATAGCC GACGATACTC TGGCAAATGC TGGTGAAGAT
ACCAACCCAA CGTTCACACT AACTGGCGAT ATTGTATTCT CATCCGATCG CTCGGCCGGC
AACCCCAATA ACCCTACTGA TATTTTCGTT GAAGAAAATG AAAACTGCTA CAAAGTAGGG
CCAAGCGAGA AGCCTTCGCT GCTACACATA ATGAACGCCT ACGGCGAAAA TATTGTGCAG
CTAACCTATG GCAACAATCA CGATACTACA CCTACCACGC TTAAAGATGG TCGAGTAGCT
TTTGTACGCT GGTCGCGCAG CTACAAAGAA GTGCCCCAGT GTGCAACTGA AAGCAATGCT
GGGTCCGGTA AATCCAATAA CGACATATTC ACATCGTTCT TTAATACTCA GCCAGATACA
CAAGGTTTAG ATGCACCTAC CAGCTGGTTG GATTCGCAAA TGTGCGCCTA TGCCATCGAT
ACACCGGTTG GCCCAGTGTT GGCAAGCAAT CACTATTCAT TGTTGCGTAT AGACCCAGTA
CGAGGCGACC TAGAGCAACT TTATAAAACC CACACCATTA ATACTTCAGA CGAAGAGTTC
TTAACAATCG ACAATATTGT TCAGTCTGAA GATGGCCGCT TAATGGCAAT TATTAAGCAC
CATTACAACA ATGTAATGGG GGGCAGTGCG GTTGCGCTTA CCGATCCGCA TACTGAAAAT
CAAACATCTG TATTTGCTTC ATTTGCTCCG CAGCCAATTA CATCCGAAGC GTCTAATTTA
TATCCTAACC AGCTTTCTGT AGCGGGTTGG TTTAGTTCTA TTGCACCTTA TCGCGATGGT
ACCGAGCGCT TATTGTTAGC GTGGTCGCAG TGTGTGACTG TATCTGAGGG GGGGGTTTCG
TCGTTTTGTT CGAGCACAAA CGAAGACGGT GAATTGAACA GCAAGTACGG TATTTGGATG
TACGATCCAA GCACCAATAG CCGTTTACCT ATTGTGCGCG CTAAAGAAGA TAAGGTGTTA
AGTGAATTAG GCCTTGGCCG TCCGCATGTT GGTTTCGACT TTCCGTTCGA ACCCTACAGC
GATGACTTTA CCGATGATTT AGACTCAAGC CGTATTATTT GTACCGACCC AGGTATCGAC
CCGAATCCAG AGCCGGAACC TGAGCCACAG CCGGAACCTG AGCCACAGCC GGAACCTGAG
CCACAGCCAG AACCTGAGCC ACAGCCAGAA CCTGAGCCAC AGCCAGAACC TGAGCCACAG
CCGGAACCTG AGCCACAGCC AGAACCTGAG CCACAGCCAG AACCTGAGCC ACAGCCAGAA
CCTGAGCCAC AGCCGGAACC TGAGCCACAG CCAGAACCTG AGCCACAGCC AGAACCTGAG
CCACAGCCAG AACCTGAGCC ACAGCCGGAA CCTGAGCCAC AGCCAGAACC TGAGCCACAG
CCAGAACCTG AGCCACAGCC AGAACCTGAG CCACAGCCAG AACCTGAGCC TCAACCAGAA
CCTGAGCCGC AGCCAGAACC TGAGCCTCAA CCAGAACCTG AGCCGCAGCC AGAACCTGAA
CCACAGCCAG AACCTGAGCC GCAGCCAGAA CCTGAACCAC AGAACTTACC ACCTATAGCG
AATGCGGGTG CCGATCAGGT GGTTTATCAG GGTGACTTGG TTATGCTTAA CGGTAGTGCC
AGTACCGACC CTGAAGGTGC TGCGTTAACC TATCAATGGA CTATGGTGTC TGCGCCAGAG
GGAAGTGAAA GCGAGCTGGT GGATTCGTAT TTGGTGTCCC CCAATATTGT GGCGGATGTA
GCAGGTACTT ATGTATTTGA TTTAGTTGTT AACGATACCG TACACAACAG CGACGTAGAC
ACTGTAATTG TGACTACACA GCCGCAAGTG TGTGATACGT CGAATATTAC TAGCAGGTAT
ATTCCCGTTA CTTTGCGCGA CTTCCATCAA AGTCACCCAG ATTTTGAATA CAAGGTTGGG
CAAGATTACG GCATCGTTGC GCCGTACTTA GGTGAAGATC GCTTACCCGT ATATGCAAAC
GAGCATGGCA GCACGCCAAC TACTAACGGT AAAGCTACTT TTGACCAGTG GTACCGCAAC
GTTGAGGGCG TAAACATCGC TTTCGATACT AGCTTAGAAA TTACCCGCGA AGGAGAAAAC
TCCGTTTGGC GTTACAGTAA CGGAAACTTC TTCCCGCTGG ATAACCAAGG CTGGGGCAAT
ACAGAAGGTC AAGACCACAA CTTCTACTTC ACTTTGGAAA CGCACCTAGA GTTCCTATAT
GAAGGTGGCG AAGTGTTTAC CTTCCGCGGC GATGATGATT TGTGGCTCTA TATCAACGGC
AAGTTGGTTA TTGATATAGG CGGTGTGCAC TCCATGATTG AGCGCAGTAT CGATCTTGAT
AAAGCAGCTG CAGAGTTGGG CATTGAAGTA GGGCAAACCT ACAGCTTCGA CTTATTCTTT
GCGGAGCGCC ATACCACGCA GTCCAACTTC CAGATTGAAA CTAACATCAA CTTGGAATGT
ACAGATAATC GATAA
 
Protein sequence
MKKKTPKLQS AILACSLLTL GACSGGDNAG TAGDTAPGNS QSQTPLEFTG GTSFVYVERS 
IAQMQQNKTN HFNNKLNNDL GTPTDLSSPY EFNPGAKLYH RTGLEIGAIE LDVLTDYFGG
TGYDVKDVSV SQDGNTIVFA AHGQAGNSAH SSWSIYTYDL TSQTIKRVIA DDTLANAGED
TNPTFTLTGD IVFSSDRSAG NPNNPTDIFV EENENCYKVG PSEKPSLLHI MNAYGENIVQ
LTYGNNHDTT PTTLKDGRVA FVRWSRSYKE VPQCATESNA GSGKSNNDIF TSFFNTQPDT
QGLDAPTSWL DSQMCAYAID TPVGPVLASN HYSLLRIDPV RGDLEQLYKT HTINTSDEEF
LTIDNIVQSE DGRLMAIIKH HYNNVMGGSA VALTDPHTEN QTSVFASFAP QPITSEASNL
YPNQLSVAGW FSSIAPYRDG TERLLLAWSQ CVTVSEGGVS SFCSSTNEDG ELNSKYGIWM
YDPSTNSRLP IVRAKEDKVL SELGLGRPHV GFDFPFEPYS DDFTDDLDSS RIICTDPGID
PNPEPEPEPQ PEPEPQPEPE PQPEPEPQPE PEPQPEPEPQ PEPEPQPEPE PQPEPEPQPE
PEPQPEPEPQ PEPEPQPEPE PQPEPEPQPE PEPQPEPEPQ PEPEPQPEPE PQPEPEPQPE
PEPQPEPEPQ PEPEPQPEPE PQPEPEPQPE PEPQNLPPIA NAGADQVVYQ GDLVMLNGSA
STDPEGAALT YQWTMVSAPE GSESELVDSY LVSPNIVADV AGTYVFDLVV NDTVHNSDVD
TVIVTTQPQV CDTSNITSRY IPVTLRDFHQ SHPDFEYKVG QDYGIVAPYL GEDRLPVYAN
EHGSTPTTNG KATFDQWYRN VEGVNIAFDT SLEITREGEN SVWRYSNGNF FPLDNQGWGN
TEGQDHNFYF TLETHLEFLY EGGEVFTFRG DDDLWLYING KLVIDIGGVH SMIERSIDLD
KAAAELGIEV GQTYSFDLFF AERHTTQSNF QIETNINLEC TDNR