Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1727 |
Symbol | cel |
ID | 4183908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 2029574 |
End bp | 2031340 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638071726 |
Product | endoglucanase |
Protein accession | YP_678336 |
Protein GI | 110638127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.296134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC TACACCTGAT ACCTGTATTT TTTTTATATG TTGTTCACGC AGCTGCGCAA TCTCCTACAC TCCTGATTGA AGATTTTGAA GATGGCAATA CGCAAAATAA TTTAGGCGGC TATTGGTATA GCTTTAATGA TAATCCTAAT GGTGGCAAAA GCCGGTTGAA ACAGACCGCC TGGCAGAAAG AAGCATTTGT TCAGACAGGT GGCTATCAGT CTGCAGGTAT GTTTCAGGTA GACGTTATCC TTGAAAAAGG GAACTATCAA TGGAATCCCT ATTTTGCATT TGCAACCAGT GTTGCTAAAT CTGTTCCCAA TAATATAAAC CCTGCTTCCT TTGCAGGTAT CTCTTACTGG CATAAAGGCG TGGCACACAA AGTGCGGGTT GAAACCGCTG AAGTTACCGA TTATGATTTC TATTCAATGC GGGTGCCGGC AAGCGATGTA TGGACCTTTG TTACCATTGA TTTTTCCATG TTAAATCAGG AAGGCTGGGG TAAGAAAGTT CCACTGAATC TGGACAACTC CATCAAACTA ATCTGGAACC TGGATGAGAC ATCCGGCAAT TTTCAGCTGG ATGATATACG TTTTGTGAAA CAAATCACCT ATGTTAAACA GCATAATATG GAAATCCTTC CGGCTGAAAT ACCTTCACCC ATTGCGGTAA AAGGAAATGT CTCAAATCCG TTAAACGATT TGTCGAAAAA GTATTTAACC AAAGGATTGA ATCTTGCCAG CTGGGCCGAA GCAAATAAAA TTACTTCTGC CAATCCAAAA GACTGGAAAT ACAATGAAGC AATTATTAAG CTGCAGGCCG AACAGGGGCT GCTGGGTATC CGTTTCCCGA TTGATCTGGA CCTCTATGTA GTAGATCGCT TAAATGTATT GAATGGCACC AAAAAGAAAA TAGAAATAGA ACCGATGCTC TATACCTTGA TGGACTCCAT GAATATCTGG ACCAAACGCT ATGGACTTTC CTTAACAATC GATTATCATG CCTATGATGG AAGCTACAAC AGAGCATCTT CCAAGGATCC AAAATTCAGA GAAGCGGTTT CTTCTTTGTG GCGTGTGGTG GCACAGCATT TTGTTAAAGA GAAACGTGAA GACTTGTTTT TTGAATTAAC CAACGAACCT TGTTTAAGTT TACCGGAAGG CGAATACATT GATCAGACAG ACTGGACATT ACTTGCTCAA ATGATGATTG ACTCTATACG CCGGGTGGAT AAAACACGTC CGATTATTTT CGGCGATACA AAATGGTATA GCTTAGACGA ATTAATTAAA AATAAGCCGT TAAAAGATCC GTATGTAATC TACTGTTTCC ATATGTACGA TCCGTTCTTG TTTACACATC AGGGAGCTTC ATGGGCCAAT ATGGGCACCA TGAAAAACAT TCCGTTCCCG TATTCACCTG AACGCTGGTC TACAGAGTTC CGCGACTTTG GTATTGTAGA CGGTACACCG GCCTGGGTAA AAGACCTGGC AAAAAGGTAT TATCAGGAAG GAAATAAACA GTTTATCAAA AACAGGCTGG CGAAAGTAAA AAACTGGGCA TATGAATATA ATGTTCCGCT GATCTGCAAT GAATGGGGAG CTTTACCGAA CACAGCTAAA ATCGAAGACC TGAACGCGTA CTTTAAAACC ATGGGAGAAA TTTTCGAAGA GATGGATATT TCGTGGCAGG TATGGTTTGG TATTATGGAT TCAGACTATA AATTACTGCC GGGAATGGCT GAAGCACTGC ATTTGAAAAA GAAATAG
|
Protein sequence | MKILHLIPVF FLYVVHAAAQ SPTLLIEDFE DGNTQNNLGG YWYSFNDNPN GGKSRLKQTA WQKEAFVQTG GYQSAGMFQV DVILEKGNYQ WNPYFAFATS VAKSVPNNIN PASFAGISYW HKGVAHKVRV ETAEVTDYDF YSMRVPASDV WTFVTIDFSM LNQEGWGKKV PLNLDNSIKL IWNLDETSGN FQLDDIRFVK QITYVKQHNM EILPAEIPSP IAVKGNVSNP LNDLSKKYLT KGLNLASWAE ANKITSANPK DWKYNEAIIK LQAEQGLLGI RFPIDLDLYV VDRLNVLNGT KKKIEIEPML YTLMDSMNIW TKRYGLSLTI DYHAYDGSYN RASSKDPKFR EAVSSLWRVV AQHFVKEKRE DLFFELTNEP CLSLPEGEYI DQTDWTLLAQ MMIDSIRRVD KTRPIIFGDT KWYSLDELIK NKPLKDPYVI YCFHMYDPFL FTHQGASWAN MGTMKNIPFP YSPERWSTEF RDFGIVDGTP AWVKDLAKRY YQEGNKQFIK NRLAKVKNWA YEYNVPLICN EWGALPNTAK IEDLNAYFKT MGEIFEEMDI SWQVWFGIMD SDYKLLPGMA EALHLKKK
|
| |