Gene Sde_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2801 
Symbol 
ID3968280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3534296 
End bp3536311 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content45% 
IMG OID637921898 
Productdystroglycan-type cadherin-like 
Protein accessionYP_528270 
Protein GI90022443 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000577586 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000325565 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGATGA AGACACAGGC ATTGGCGCTC TCAGCGCTAT GTATGGCTAT GGTCGGCTGC 
GGTGGCAGCA GCAGTAGTGA CAATGGCGGT GAAGCCGATA TTCGCTTTCC TTTTGATGGC
GAGATAACTT TATCTGGCGA CACCGATCCA GGTGCCACCC TAACCGCAGT CGTAACAGAT
GCTAATGGCA TCTCAGGCTC TATTTCCTAC GTTTGGACTT CCGGCGATAC TGTTATTACC
AGTGCGACAG GCTCTACTTA TGTAATTGCC GATTCCGACC AAGGTAATAG CATAAGCGTA
ACGGCTACTT ATACCGATGA CGATAACTTT GATGAGTCTG TTTTTGCTTC CGTTACAATT
GAAGCAGCAG CTACTCCGGC GACATTTGCA GGTTTAACTG CCACTGTTAG CAATGAAGCA
ACAGAAGCGC TTACTGGTAC TGTTACTGTT ACCGACCCCA ATACTGGTGA AGCGACAATT
GTAGCGCTAA CCGACGCTAT GACTACATAC GGCACTTTTT CTATTACTGC GGAAGGCAGT
TGGTCATACA CTCTAGACAC TTCAGCAGAC GCCGTTGCGA ACCTAACCTC TACTGACGAT
CCGTTATTAG ACTCTATCGA GCTTGAGTCG GCTGACGGTA CAACTGCAAA CTTGGTTATC
ACAATCACTG GTGCAGAGGT TTCCGGCCCA GTTACAAGCC AAGTGGCTCG TATCACTGAT
AACTCAACCG ACGATACTGG TGAGCTACGA TATGCACTAC CTTCTGCGCA GCTAGCGGGT
AAAATCACTG TATCTTTCTT GAAAGATCTA GATACTTTAG GCTCTGATGA CACCATTAAG
GATGCTTACA TCACCTTGTA TAATACTGAT ACAAGCACAA GTGGTGGCAA AGCGATTCTT
GATTTACGTA TTCAAGATGA CAATTTTGCA ATTCGTGATC AAGATGGCAT TGATGTGATG
AATGCTTTCA CTCCAGGTCA ATGGCAGGAT GTAGAGATTA CTTGGGAAGC CGCTGATGAT
GCCTCTGCTC CTGTGCTGAA CATCCTTATC GATGGCGTAG CCGTTACTTC CGTACCTTAC
ACTGGTTCTT CAACTGCAAT AGGTGGTGTT ACACACGTTG CCTTTAGATT CGCAGATAAC
TCAAGAACAG TAACGGGTAC TTACAATATA GATAACCTAT TCATCTATTC AGATACAGCT
GGCACTGCTT TGGTGTTCTC TGATGACTTT GAAGGTTATA GTGTTGATGA TTCGCTTGAT
ACTGACAACG CGAACTCGCC TTATAACTCA AGTACTTTCG AAGCTGTTGT AGCGGTTATG
GAAGTTCCTG GGGATGACTC CGGTTCGGGT GGCTCTGGCG ATACATCTGG CCCTGGCACC
GCTGGCAATA AGTATGCAGA AATTATTGAT ACAAGCACCG ATGACACTGG TGAATTGAGA
TATGCTCTGC CTGCAGCGCA GTTGGCGGGT AAATTAAATG TATCTTTCCT CAAAGACCTT
GATACTTTAG GGTCTGACGA TACGATTAAA GATGCTTATA TCACTCTGTA CAACACTGAT
ACTAGTACTA GTGGTGGTAA AGCAATTCTT GATTTACGTA TTCAAGATGA CAACTTTGCA
ATACGTGACC AAGACGGCAT CGATGTGATG AATGCCTTTA CACCAGGTCA GTGGCAGGAT
GTTGAAGTAA CATGGGAAGC TGCTGATGCT TCGTCTGCTC CTGTGTTGAA TATTCTTATC
GATGGTGTTG CGGTTACTTC GGTTCCTTAT ACCGGTTCAG CTACGGCTGT TGGTGGTGTT
ACCCATATCG CATTCCGATT TGCGGATAAC TCCAGAACAG TAACTGGTAC TTTTAATGTT
GATGATATTA AAATCTACTC TGATACTGCT GGTACCGCGT TAGTATTTGA AGATAGCTTT
GAGAGTGGTT ACAACACTGG TGATTCGCTA GATACTGATA ATGGTTCTTC ACCTTATCAC
TCAGCTACTT CTGAAGCTGT TGTTGCCGAG GAATAA
 
Protein sequence
MKMKTQALAL SALCMAMVGC GGSSSSDNGG EADIRFPFDG EITLSGDTDP GATLTAVVTD 
ANGISGSISY VWTSGDTVIT SATGSTYVIA DSDQGNSISV TATYTDDDNF DESVFASVTI
EAAATPATFA GLTATVSNEA TEALTGTVTV TDPNTGEATI VALTDAMTTY GTFSITAEGS
WSYTLDTSAD AVANLTSTDD PLLDSIELES ADGTTANLVI TITGAEVSGP VTSQVARITD
NSTDDTGELR YALPSAQLAG KITVSFLKDL DTLGSDDTIK DAYITLYNTD TSTSGGKAIL
DLRIQDDNFA IRDQDGIDVM NAFTPGQWQD VEITWEAADD ASAPVLNILI DGVAVTSVPY
TGSSTAIGGV THVAFRFADN SRTVTGTYNI DNLFIYSDTA GTALVFSDDF EGYSVDDSLD
TDNANSPYNS STFEAVVAVM EVPGDDSGSG GSGDTSGPGT AGNKYAEIID TSTDDTGELR
YALPAAQLAG KLNVSFLKDL DTLGSDDTIK DAYITLYNTD TSTSGGKAIL DLRIQDDNFA
IRDQDGIDVM NAFTPGQWQD VEVTWEAADA SSAPVLNILI DGVAVTSVPY TGSATAVGGV
THIAFRFADN SRTVTGTFNV DDIKIYSDTA GTALVFEDSF ESGYNTGDSL DTDNGSSPYH
SATSEAVVAE E