Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3862 |
Symbol | |
ID | 3967029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 4866601 |
End bp | 4868349 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637922959 |
Product | hypothetical protein |
Protein accession | YP_529329 |
Protein GI | 90023502 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00590592 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.508788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTAC TAAATAACAA AACCCTTCGT GCTATTTCTG TATTGTGTAG TTTAACGTTT GTGCCCTGTG CTATTGCCGA GGTGGCGCCG CTTTGGTTGC AGTTTGAGGC CGCAAAGGCG AATGGCGATG AGCCAACCTT GCCGGACTTT TCTTATGCGG GTTACGACTA TTCCGAGAGC GAGCTGCCAG ACATTTCTAG TTGGGCCGTA TTTAATGTGA CGGACTACGG GGCAATAGCA AACGACGAAA ACTACGATGA CCAAGCCATA CAGCTGGCAA TAGATGCCGC ACAAAACGCT GGTGGCGGTG TGGTTATGTT CCCCGCTGGC AGGTTTTTAG TTAGCCCTAA CGAAACGGTG GGCGAGAATA TTTTTATTCG CGCTAGCAAC ATAGTGCTAA AAGGCGCGGG CGCTGGTGAT AACGGAACAG AAATATTTAA AGTAAATAAA AAAGTAAATA ACGGCGAGTA TATTTTTGAG GTTTCGCCAA CAAGCACTGG CGAATCGGTG ATTACAACTG TAGTAGCAAA TGCACAGCGC GAAAGTTTCG AAATAGTAGT TGCCGATGCA TCACAACTGA GTGTGGGCCA GCGTATTTTA TTGCGAGCAG ATAGCGTGGA ATTAGCGCAA TCGTATTATT CCCCGCTTAC CATTCGCAGC GAATGGACGC GATTACTAAA CGACGGCTTT AACCTGCGAG AAATTCACTC TATTGCCGCA ATTAACGGCA ACACCGTGCG CCTTAGAGAA CCACTACACA TAAGCCTAAC TATAGGTTCT ACGCCCATTC AGGTGCGTAG TTACAACATG ATAAATAACA TAGGTATAGA AGATATTCGC TTTAAAGGTA ATTGGGATTC CTACCCCGAA GACTTCGATC ACCATAAAGA CGATATTCAC GATTACGCAT GGAACGCACT AAGGTTAGAT AACGTAGAAA ACGGCTGGAT ACAAAACGTA GAATTTAAAG ATTGGAACCA AGGTATTTAT ATAGACGGCT CTGCAGCACT TACCCTGCGT AATATTACCT TTACCGGTAA GAAGGGGCAT ATGTCTATTC ATACGCGGCG CTCTTACGGT GTATTAATAA AAGATAGTGC CGACCACGCA GGCCATCACC ACGGGCCGGG TGTTGGCTAC TGGGGTTGCG GCACTGTGTA CCTGCGTTAC CAAATGTTAG CTGGCCAAAA TATAGATAGC CATTCGGGCA GCCCCTACGC AACGCTGTTT GATAATGTAA CTAACGGTCA CCTTTCTAAC AACGGCGGCC CACACGAAAG TTACCCGCAC CACGGTAAAC ATTTAGTCGC ATGGAATATG ACTTTAGAAG GCGGGCCAGA TAGCTATAAT TTTTGGTCGG CGTCTCGCAA CGGCCACACC TTCGCCATGC CGTATTTTAT TGGCTTGCAG GGTAAAAGCG TTACCTTTAC CGAAGGCACC TATAGCGCTA ATGAATTGCT AGGGCAAATG GCCGAGCCTG CTTCTTTATT TGAAGCGCAA CTTGCCTTGC GCCTTGGTAG CACTGCGCCC GAGCAACCAG AAGAAACAGA AGAACCAGAA CAAGAGCCAG AGGGTGAAAC CGGTGGCGAG CAACAACCAA CAGAAGAAAC CCCTACACCA CAACCCAATC AGCCAAACGC CAGCAACACA TCGGGTGGTG GAGCCATAGG TGGGCTATTG CTCACATTGT TGATGGTGCT GGCAGCTACT ACCAAAATTA ATTACATAAC GCGCCGCACA GGTAATTAA
|
Protein sequence | MRLLNNKTLR AISVLCSLTF VPCAIAEVAP LWLQFEAAKA NGDEPTLPDF SYAGYDYSES ELPDISSWAV FNVTDYGAIA NDENYDDQAI QLAIDAAQNA GGGVVMFPAG RFLVSPNETV GENIFIRASN IVLKGAGAGD NGTEIFKVNK KVNNGEYIFE VSPTSTGESV ITTVVANAQR ESFEIVVADA SQLSVGQRIL LRADSVELAQ SYYSPLTIRS EWTRLLNDGF NLREIHSIAA INGNTVRLRE PLHISLTIGS TPIQVRSYNM INNIGIEDIR FKGNWDSYPE DFDHHKDDIH DYAWNALRLD NVENGWIQNV EFKDWNQGIY IDGSAALTLR NITFTGKKGH MSIHTRRSYG VLIKDSADHA GHHHGPGVGY WGCGTVYLRY QMLAGQNIDS HSGSPYATLF DNVTNGHLSN NGGPHESYPH HGKHLVAWNM TLEGGPDSYN FWSASRNGHT FAMPYFIGLQ GKSVTFTEGT YSANELLGQM AEPASLFEAQ LALRLGSTAP EQPEETEEPE QEPEGETGGE QQPTEETPTP QPNQPNASNT SGGGAIGGLL LTLLMVLAAT TKINYITRRT GN
|
| |