Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_05530 |
Symbol | |
ID | 8374761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | + |
Start bp | 654438 |
End bp | 655673 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644993477 |
Product | peptidase family protein |
Protein accession | YP_003150954 |
Protein GI | 256826995 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.892794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 111 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAG AGCAGCTTTC ATTCCTCGAA ACTATGGTAG CAACACCTTC GGTAACGGGA TGCGAGCACC TGGTTGCAAG TCTTGTTCGA GAGCGCTTAG CCGATGTCGC TGATGAAATC GATACCGATG TCATGGGCTC GGTGCATGCA CGCGTGCGGG GGACAGGTGT TCCCAACGAC GCAGCATTAT CTGCACAGAC AGATGGCGGA GCTTCTGGTG AAGAAGCGGC AGGCGTGACG AATGAGGCCG AGACAGCAGG CGTGACGAAT GAGACCGCTG CGGCAACAGC ATCAGATGAA GTCACCGCGG CAACAGCATC AGATGAAGCA GCACCCCGTG TTCCTTCGGT TTTAATTGCT GCTCACATGG ATGAGATTGG CCTTATGGTC AATTACATTT CTGACGAGGG TTTTCTCAGT GTAAAACCGG TTGGTGGTGT TGACGCTGGT CTGCTTCCTG GATTGCGGGT GGATGTGCAC ACTGACGAAG GAGCCCTGCG CGGAGTTGTT GGCCGCCAAC CAATTCATTT GGTACCTGCC GATGAACGCA AGCAGGTGGC TCCTTTCGAA AAGCTTGTTA TTGATCTTGG TATGCCGGCT GATATGGTTA AACAGCGCGT ACAGATAGGC GATACCGTAA CGTACGGAGT TGGCTTTGAG CGCTTCGGCG ATGGTATGGC CGTGTCGCGT GCCTTTGACG ATAAAGCAGG CGTTTTTATT GGGATTGGCG TAGCGTGCGA ACTGGCACGC GGTCCACGTG CAGCGGCTGA TTTCATTCTT GCTGCTACCG TACAGGAAGA AATCGGTTTG CGCGGTGGAC TCACGAGCGC TTACAGCGTG GATGCCGATG TGTGCCTGGC TTTCGATGTA ACGCATGCCA CTGATTATCC CGGTATTGAT AAGGCAAAAC ACGGCAACAT TGTCTGTGGC GGTGGACCGG TTATTGCGCG GGGTCCGAAT ATCAATCCTG TTGTCTTTAA TCGTCTTGTC GCAGCAGCGC AGGCTGAAGG AATTGATTAT CAGATAGAAG CTGAACCGGG TGTAACCGGC ACCGATGCAG GGGAAATGCA GATTCAACGT GGAGGTAAAG CATGCGGATT GGTGTCGTTG CCACTGCGCT ATATGCATAC CCCGACGGAA GTCATCAATC TGTCCGATGT GGATGGCTGT ATTAAGCTTC TTGCGCGGTT CATTCGCGAT TTAGATGCTG ACGTTTCGTT TGTGCCGGGG ATGTAA
|
Protein sequence | MNEEQLSFLE TMVATPSVTG CEHLVASLVR ERLADVADEI DTDVMGSVHA RVRGTGVPND AALSAQTDGG ASGEEAAGVT NEAETAGVTN ETAAATASDE VTAATASDEA APRVPSVLIA AHMDEIGLMV NYISDEGFLS VKPVGGVDAG LLPGLRVDVH TDEGALRGVV GRQPIHLVPA DERKQVAPFE KLVIDLGMPA DMVKQRVQIG DTVTYGVGFE RFGDGMAVSR AFDDKAGVFI GIGVACELAR GPRAAADFIL AATVQEEIGL RGGLTSAYSV DADVCLAFDV THATDYPGID KAKHGNIVCG GGPVIARGPN INPVVFNRLV AAAQAEGIDY QIEAEPGVTG TDAGEMQIQR GGKACGLVSL PLRYMHTPTE VINLSDVDGC IKLLARFIRD LDADVSFVPG M
|
| |