Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2762 |
Symbol | |
ID | 3968358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 3497771 |
End bp | 3499120 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637921862 |
Product | cellulose binding, type IV |
Protein accession | YP_528234 |
Protein GI | 90022407 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000724406 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTCCG AAATCAAAAA GCGCCTGCGG TGTGCTGTCT ACACACGTAA ATCAACCGAT GAAGGGCTCG ACCAGGAATA CAACTCGATT GACGCCCAGC GAGATGCGGG CCACGCCTAT ATCGCCAGTC AACGCGCTGA AGGCTGGATC CCAGTCGAAA ACGACTATGA CGCCCCGGCA TACTCCGGCA GCAATATGGA CCGCCCTGCG ATGCAACGTA TGCTGGCCGA CATTAAGGAA GGCAGAGTCG ATGTTGTGGT GGTCTACAAG ATAGACCGAC TGACACGCAG TCTGATGGAC TTCTCGAAAA TGATCGAAGT CTTTGAGCGA CATGGCGTGT CCTTTGTATC AGTGACCCAG CAATTTAACA CCACCAATTC GATGGGGCGG CTGATGCTGA ATATCTTGCT GTCCTTTGCG CAGTTCGAAC GAGAAGTCAC CGGTGAACGC ATCCGCGATA AAATCACCGC CAGCAAGAAA AAAGGGCTCT GGATGGGTGG CATTCCGCCG CTTGGTTACG ATGTCGTCGA TCGCTGCCTG GTGATCAATC CGCAGGAAGC CAAACTCATC AAGCATATTT TTAAGCGGTT CACGGAGATC GCCTCCACCA CCCTTCTCTA CAAAAAGCTC AGATTGGAGA ATGTCATGAG CAAGTCGTGG ACAACACAAG ACGGTCGTCA TCGCCCAGGC AAGCCGATTG ATCGGGGGCT GATATACAAG CTGTTAAACA ACCGGACCTA CCTTGGTGAG CTGCGACACA AGGACCAATG GTACGAAGGA AAACACGAAC CGATTATCGA CAAAAAGCTC TGGGGCGATG TGCATTCGAT TTTGGCTGTT AACTTCCGGA CGCGAGGCAA CTACACCAAA GGCAAAATCC CATTTCTGTT AAAGGGTATG ATTTTCGGCG AGGATGGCCG CGCTTTGACA TGCTGGACCT CGGCCAAGAA GAAAAGCGGG CGCCGATACC GGTATTACAT CAGCACCCGG GACACTAAGG AGTTTTCAGG GGCTTCCGGC CTGCCGAGAA TACCGGCGGC CGAACTTGAA TCGGTGGTGG TCGATCAGAT TCGTGGCCTG CTACAAACGC CACCGGTCAG AGAGCGAATC GCGGCCGTTA CCGGCCAGCG GGAAGATGCC CTGGATGAGG CTCAAGTAGC GGTGGCCCTC AACCAAATAG ACAAAGTGTG GGATCAGCTT TTCCCAGAAG AGCAGGCTCG GATCATCCGG CTGATGGTCG AGAAGGTGGT GGTCAGCCCC GATCGCGTGG ATGTGCGGCT GAGGGACAAC GGGGTTGAGC GGCTGGCCCT GGAAATCACC GACTCCTACA AGCAGGAGGA TGTGGCATGA
|
Protein sequence | MSSEIKKRLR CAVYTRKSTD EGLDQEYNSI DAQRDAGHAY IASQRAEGWI PVENDYDAPA YSGSNMDRPA MQRMLADIKE GRVDVVVVYK IDRLTRSLMD FSKMIEVFER HGVSFVSVTQ QFNTTNSMGR LMLNILLSFA QFEREVTGER IRDKITASKK KGLWMGGIPP LGYDVVDRCL VINPQEAKLI KHIFKRFTEI ASTTLLYKKL RLENVMSKSW TTQDGRHRPG KPIDRGLIYK LLNNRTYLGE LRHKDQWYEG KHEPIIDKKL WGDVHSILAV NFRTRGNYTK GKIPFLLKGM IFGEDGRALT CWTSAKKKSG RRYRYYISTR DTKEFSGASG LPRIPAAELE SVVVDQIRGL LQTPPVRERI AAVTGQREDA LDEAQVAVAL NQIDKVWDQL FPEEQARIIR LMVEKVVVSP DRVDVRLRDN GVERLALEIT DSYKQEDVA
|
| |