Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2801 |
Symbol | |
ID | 3968280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3534296 |
End bp | 3536311 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637921898 |
Product | dystroglycan-type cadherin-like |
Protein accession | YP_528270 |
Protein GI | 90022443 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000577586 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000325565 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGATGA AGACACAGGC ATTGGCGCTC TCAGCGCTAT GTATGGCTAT GGTCGGCTGC GGTGGCAGCA GCAGTAGTGA CAATGGCGGT GAAGCCGATA TTCGCTTTCC TTTTGATGGC GAGATAACTT TATCTGGCGA CACCGATCCA GGTGCCACCC TAACCGCAGT CGTAACAGAT GCTAATGGCA TCTCAGGCTC TATTTCCTAC GTTTGGACTT CCGGCGATAC TGTTATTACC AGTGCGACAG GCTCTACTTA TGTAATTGCC GATTCCGACC AAGGTAATAG CATAAGCGTA ACGGCTACTT ATACCGATGA CGATAACTTT GATGAGTCTG TTTTTGCTTC CGTTACAATT GAAGCAGCAG CTACTCCGGC GACATTTGCA GGTTTAACTG CCACTGTTAG CAATGAAGCA ACAGAAGCGC TTACTGGTAC TGTTACTGTT ACCGACCCCA ATACTGGTGA AGCGACAATT GTAGCGCTAA CCGACGCTAT GACTACATAC GGCACTTTTT CTATTACTGC GGAAGGCAGT TGGTCATACA CTCTAGACAC TTCAGCAGAC GCCGTTGCGA ACCTAACCTC TACTGACGAT CCGTTATTAG ACTCTATCGA GCTTGAGTCG GCTGACGGTA CAACTGCAAA CTTGGTTATC ACAATCACTG GTGCAGAGGT TTCCGGCCCA GTTACAAGCC AAGTGGCTCG TATCACTGAT AACTCAACCG ACGATACTGG TGAGCTACGA TATGCACTAC CTTCTGCGCA GCTAGCGGGT AAAATCACTG TATCTTTCTT GAAAGATCTA GATACTTTAG GCTCTGATGA CACCATTAAG GATGCTTACA TCACCTTGTA TAATACTGAT ACAAGCACAA GTGGTGGCAA AGCGATTCTT GATTTACGTA TTCAAGATGA CAATTTTGCA ATTCGTGATC AAGATGGCAT TGATGTGATG AATGCTTTCA CTCCAGGTCA ATGGCAGGAT GTAGAGATTA CTTGGGAAGC CGCTGATGAT GCCTCTGCTC CTGTGCTGAA CATCCTTATC GATGGCGTAG CCGTTACTTC CGTACCTTAC ACTGGTTCTT CAACTGCAAT AGGTGGTGTT ACACACGTTG CCTTTAGATT CGCAGATAAC TCAAGAACAG TAACGGGTAC TTACAATATA GATAACCTAT TCATCTATTC AGATACAGCT GGCACTGCTT TGGTGTTCTC TGATGACTTT GAAGGTTATA GTGTTGATGA TTCGCTTGAT ACTGACAACG CGAACTCGCC TTATAACTCA AGTACTTTCG AAGCTGTTGT AGCGGTTATG GAAGTTCCTG GGGATGACTC CGGTTCGGGT GGCTCTGGCG ATACATCTGG CCCTGGCACC GCTGGCAATA AGTATGCAGA AATTATTGAT ACAAGCACCG ATGACACTGG TGAATTGAGA TATGCTCTGC CTGCAGCGCA GTTGGCGGGT AAATTAAATG TATCTTTCCT CAAAGACCTT GATACTTTAG GGTCTGACGA TACGATTAAA GATGCTTATA TCACTCTGTA CAACACTGAT ACTAGTACTA GTGGTGGTAA AGCAATTCTT GATTTACGTA TTCAAGATGA CAACTTTGCA ATACGTGACC AAGACGGCAT CGATGTGATG AATGCCTTTA CACCAGGTCA GTGGCAGGAT GTTGAAGTAA CATGGGAAGC TGCTGATGCT TCGTCTGCTC CTGTGTTGAA TATTCTTATC GATGGTGTTG CGGTTACTTC GGTTCCTTAT ACCGGTTCAG CTACGGCTGT TGGTGGTGTT ACCCATATCG CATTCCGATT TGCGGATAAC TCCAGAACAG TAACTGGTAC TTTTAATGTT GATGATATTA AAATCTACTC TGATACTGCT GGTACCGCGT TAGTATTTGA AGATAGCTTT GAGAGTGGTT ACAACACTGG TGATTCGCTA GATACTGATA ATGGTTCTTC ACCTTATCAC TCAGCTACTT CTGAAGCTGT TGTTGCCGAG GAATAA
|
Protein sequence | MKMKTQALAL SALCMAMVGC GGSSSSDNGG EADIRFPFDG EITLSGDTDP GATLTAVVTD ANGISGSISY VWTSGDTVIT SATGSTYVIA DSDQGNSISV TATYTDDDNF DESVFASVTI EAAATPATFA GLTATVSNEA TEALTGTVTV TDPNTGEATI VALTDAMTTY GTFSITAEGS WSYTLDTSAD AVANLTSTDD PLLDSIELES ADGTTANLVI TITGAEVSGP VTSQVARITD NSTDDTGELR YALPSAQLAG KITVSFLKDL DTLGSDDTIK DAYITLYNTD TSTSGGKAIL DLRIQDDNFA IRDQDGIDVM NAFTPGQWQD VEITWEAADD ASAPVLNILI DGVAVTSVPY TGSSTAIGGV THVAFRFADN SRTVTGTYNI DNLFIYSDTA GTALVFSDDF EGYSVDDSLD TDNANSPYNS STFEAVVAVM EVPGDDSGSG GSGDTSGPGT AGNKYAEIID TSTDDTGELR YALPAAQLAG KLNVSFLKDL DTLGSDDTIK DAYITLYNTD TSTSGGKAIL DLRIQDDNFA IRDQDGIDVM NAFTPGQWQD VEVTWEAADA SSAPVLNILI DGVAVTSVPY TGSATAVGGV THIAFRFADN SRTVTGTFNV DDIKIYSDTA GTALVFEDSF ESGYNTGDSL DTDNGSSPYH SATSEAVVAE E
|
| |