Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1336 |
Symbol | |
ID | 4185841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 1557622 |
End bp | 1560558 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638071330 |
Product | endoglucanase-like protein |
Protein accession | YP_677948 |
Protein GI | 110637741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.557457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0374623 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAA AAACCAGTCT CCTGCTTTTT GCTTACTTAA TCGTATTTAC CCATCAATTA TTCAGTCAAA CCCCTTACTT TACCAGTACA GAGTATAAAA AAGCACTCTG GATGACTACA CGTTTTTACG GCGGTCAACG TTCCGGCGAT AATAACTGGC TATTATATAA CCATTTACCA TCAGGTGTGG ATGCTTCGTT GAGAGGTAAA GCATTTATAG CAGATAAAGA CGGCACCTAC GATCTTTCCG GCGGATGGCA TGATTGCGGT GATCACGTAA AATTCGGGCA AACAGAATTC TATTCAGCTT ACATGTTGTT AAAAGGATAT GCAGAATTTC CGGCTGGTTA TGGTGACTAT TATGCATACG ACTATCAAGG ATATAAAACA TCCGGCAGCT GGTCTTTTGA AGGAACAGGA CACGCACCTA ATGGGATTCC GGACATTTTA GATGAAGTCA AACATGCAAC CGATTTTTTC ATCAAATGCG CCAAAGATGC AACAACTTTT TATTACCAGG TTGGCCAGGG TGATCCGGAT CACAAACAAT GGGTAACAGC AGTAAAAATG CAGACGTTAT CTGTTGCAAA CGGCGGACAG ACACGTGCTA CCTATAAAAA CCCGAACGAT GCTTCCATGC CTTCTTTTTG CGGTGCTACC TTAGCATTAA TGTCGCGTTT ATACAGACCA TATGATGCCG CTTATGCCGA TTTATGTCTT GTGCATGCAA AATATGCTTA TGACTATGCA AAAACAAAAA CTTCCACTGT TGGTTCACCT GATGGAAGCT TTTATGCTGC AAACGATAAT TACAAGGATG ATTATTCAAC AATGTGTGCA GAATTATTCT GGGCTACAAA CACCGCCTCC TATAAAACAG AAGCGCTAAG CTTTTCAATA AGTACCGCTC CGGGCCAAGG AGCTGACATT TATGGAAAAA ATTACGGGTT TGATTATTCA AACAATGGTG ATATTGCAAT TTATAACCAG GCATTACTAG GCAATACATA TGCAAAAGAT GTACTAAATT CAATTGTAAA TTCTTTTTAT TTAAATAATA CCCGAAGTGA TGGTCAGTTT AATGATGGCA ATACCAGCTG GGGGCCGCTG CGTTATAATG CCAACACCGC TTTCATTGTG GCTTTATGGC AAAAACTATA TGGTACAAGC GCTACTCCTA ATAAATACAT TTATGACAAC ATCGATTATA TTTTAGGAAA AAACTCCAGC AACCAATCAT TTGTTGTTGG TTTCGGAACA AAATCTCCTG CGCACCCACA CCATAGAAAT GTTTATTTAA GTGATGCAAA CACGCCGGTA AACAGCCTGG CAATACCTGC AAAAAATCAG CAATTTGGCT TAATGGTAGG CGGTACAAGA AATGCCGGTT CCTTTAATGA CAATCTTGAA ACCTATACAC ATACAGAAGG CGGTATTGAT TACAATGCTT GCCTGGTTGG TGTATTAGCC TACATCAATT CCGTACTCAG CCCTGTCGTT ACACACTCTA CGCCAAGCCT TGGCGCCGAC CAGTCTCTAT GCGGCAAAAC AAGTATTGTG TTAAATTCAA ATGTAAACGT TGACAATATC AAAACATATA CCTGGAAAAA CGGCACTACG ATTGTACAGG CAGCTTCAAA AACAGCTAAA ACACTTACCG TAACTTCAGC CGGTACATAT ACCTGTATTC TTGACTCACT AGGCATCTGG TCTACTCAGG ACGAAATTGT TATTACCTCT ACTCTTCCGG CAGTTAATCT TGGAGCAACA AAACAATTGT GCAACCCTGC AACGGCAACA TTAGATATCG GCGCTTCAGG AACAGGCTAT ACGTATCAAT GGAAAAAGAA TAACGTTGTG TTATTGGGAG AAACGTCTAA AACGTTGGTG GTACATAAAT CCGGAACGTA TATTGGTATA TTAAGCGCAA CAGGCTGTCC TTCTACTTCA GGAAGTGTAA CTATTACATC GCTTCTGCCT AATGCAGGCA ACGATACCCT TTGCAGTGCA GGAATAGCAA ACCTTAGCAT CAGCGGAAGC GGCGGTCCAT ATCAATGGTA TACTGCAGCA ACCAACGGCA CATTACTTAC AACAGGTACT ACGTACAAAC CAGCCGTATC TGCTCCTGTT ACAACCTATT ATGTACAGGA TGGCGGTTCC GTAAATGTTA CAGCAGTACC AAGCAATACA GGCTTCACAG GTCCTCAAAA TGCAGGCTCA ATCGGCATTA CCTTTACAGC TGCAAAAGCG TTTACGATTA CACAGTTAAA AGTATTACCT TATGTGTACA GCTGTAACGG AGAAAATGTA TTTGTTACCT TCGATCTTTT CCAGGGAACA ACGAAACTTG CTTCATACAC CTCTTCATCT GTTGTTTGTA CAGGTACACA ATCCGGAACA ACTTTTACTC CTTATACATT AAACTTCCCT ACAGCAATAA ATATTCCTGC TGCCGGTTCT TATACATTAA CACCAAGTGC GGGTAATCAA TTAGTATGGT ATAGTGGCGG TGCCGATTAT ACAACGATGG ATGTTAGTGG AGTCATTGAT GTTACAGGAC CTACAAGTAC CTATCAGCCA AATTCATTCC CTGGAATTTA CGACATAAAA ATTCTATCCG GATCTGGTTG TGCAAGAACT CCTGTATACG CTGTAGTAAA TTCCAATTAT CCGTCATGTA AAATAACTAC GGATATTAAT ACATCTGCAC AGGCAAAAGG ATACAACATT TATCCAAACC CGTCAAACGA AACATTCAGG TTGACAAGTC CATCTACAGT TGATATAACC ATTATGAATG CACTGGGCCA GGCTGTTGAA TCCTTCTCAA ATATCACGGA ATTGAATTTT GGACAGCATT TAAAACCAGG CATGTACTAT GTGATAATGA GTCAGGATGG ATTAACCATT CAAACGATGA ATATCATTAA ATACTAA
|
Protein sequence | MIKKTSLLLF AYLIVFTHQL FSQTPYFTST EYKKALWMTT RFYGGQRSGD NNWLLYNHLP SGVDASLRGK AFIADKDGTY DLSGGWHDCG DHVKFGQTEF YSAYMLLKGY AEFPAGYGDY YAYDYQGYKT SGSWSFEGTG HAPNGIPDIL DEVKHATDFF IKCAKDATTF YYQVGQGDPD HKQWVTAVKM QTLSVANGGQ TRATYKNPND ASMPSFCGAT LALMSRLYRP YDAAYADLCL VHAKYAYDYA KTKTSTVGSP DGSFYAANDN YKDDYSTMCA ELFWATNTAS YKTEALSFSI STAPGQGADI YGKNYGFDYS NNGDIAIYNQ ALLGNTYAKD VLNSIVNSFY LNNTRSDGQF NDGNTSWGPL RYNANTAFIV ALWQKLYGTS ATPNKYIYDN IDYILGKNSS NQSFVVGFGT KSPAHPHHRN VYLSDANTPV NSLAIPAKNQ QFGLMVGGTR NAGSFNDNLE TYTHTEGGID YNACLVGVLA YINSVLSPVV THSTPSLGAD QSLCGKTSIV LNSNVNVDNI KTYTWKNGTT IVQAASKTAK TLTVTSAGTY TCILDSLGIW STQDEIVITS TLPAVNLGAT KQLCNPATAT LDIGASGTGY TYQWKKNNVV LLGETSKTLV VHKSGTYIGI LSATGCPSTS GSVTITSLLP NAGNDTLCSA GIANLSISGS GGPYQWYTAA TNGTLLTTGT TYKPAVSAPV TTYYVQDGGS VNVTAVPSNT GFTGPQNAGS IGITFTAAKA FTITQLKVLP YVYSCNGENV FVTFDLFQGT TKLASYTSSS VVCTGTQSGT TFTPYTLNFP TAINIPAAGS YTLTPSAGNQ LVWYSGGADY TTMDVSGVID VTGPTSTYQP NSFPGIYDIK ILSGSGCART PVYAVVNSNY PSCKITTDIN TSAQAKGYNI YPNPSNETFR LTSPSTVDIT IMNALGQAVE SFSNITELNF GQHLKPGMYY VIMSQDGLTI QTMNIIKY
|
| |