Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_2149 |
Symbol | |
ID | 4187073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 2497450 |
End bp | 2500776 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 638072149 |
Product | glycoside hydrolase family 5 protein |
Protein accession | YP_678754 |
Protein GI | 110638545 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000532282 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA GTTTACTGCT CTTTGCGCTT ATTTTTACAT CTGTGATTGC CTCTGTTGGG CAGCAGGTAA CCATTATTAA CAAAAAATTT GTCGTGAACG GCAATGCTTC CTGTCCGATT TACTTCAACG GAGCTAATAC ACCCTGGGAC AACTGGAACG ACTTCGGTGG AAATTACGAT GCAGCATTCT GGTCTGCACA TTTCGCAACC CTAAAAGCAA ACGGGATCAA TGCTACCCGC GTATGGATCA GCTGTAACGG AGATGTGCAG CCCAATATCA ATACCGATGG AACAGTAACC GGCGTCAGCA CACAATTCTG GGCCAACGTC GATGACTTTT TTCAATCTGC AAAAAATAAC GGAATCTATG TAATGGCAAC TATGATGTCG TTTGACCATA CAAAAAACAC GTATACAAAA TATCAGAGCT GGAGAAACAT GCTGAACGAT CAGGCAAAAG TTCAATCCTA TTGCGATAAC TACCTTGTAC CTTTTGTGAA TCGCTACAAA ACGAATCCGT ATCTGATGTC TATTGATATT TCAAATGAAA TTGAATGGGT TGCAGAAGAT GCAAACAATA TGAAATGTTC GTATGCCGTA CTGCAGCGCT TTGTTGCCAT GTGTGCTTCA GCCATTCATA ATAATCCAAG AACAGATGGT ACATCGGTAT TGGTTACGAT GGGTTCGGCA GCTACTAAAT GGAATGCTAC TAAAATGCGT ATCGGTCAAA ATGGTGCCTG GTCACAGAAT AATTCAGATG GAAATAAGTG GAGCGATGCT GCTTTAAAGG CACAATACAA CCAGGCTAAC GCGGTGCTGG ATTTTTATTC CCCGCATTAT TATGCCTGGA TCGATGGATA TTATTCTAAC CCGTATGTGC GTACACCGAG TGATTTCGGT ATGGATGAAA AGGCTGTTCT TATTGGTGAA ACACCAGCGG GCAATCCCGG CACGCCAAAC CTTACGCCAT TGGCATCTTA TGAAGCGTTA AAAAATAATG GCTATCAGGG GCATTTTCCA TGGACATCAA ATAGCGTAGA CAGTAACGGA GGGATTGAAA AATTCGGTAC AGATGCAAAA ACATTTTCAA CAACATATAG CGCACTTGTA AAACCAACCT GTGCAGTAGC TTGTACAACA CCGGCTCCAA CGGTAACAAC ACCTGTCGTT TATTGTAAAA ATGCGTCAGC TGTTGCGTTA ACAGCAACAG GCACAGCCCT GAAATGGTAT ACAGATAATA CAACCACAAC GGCATTTTCC ACAACACCGA TACCTTCAAC AACAGTAGCT GGTACAACAA GTTATTATGT TTCACAGACG CTAAATGGTT GTGAAGGAAC AAGAGCAGCA GTACAAGTAA CAGTAAAAGA ACTACCGGCA GCAACGATCA CCACAACAAC GGCAACCACA TTCTGTGCCG GTGGAAGTGT GAGCTTAGCC GCGAATACAG GCACAGGCTT AACGTATGTC TGGAAGAAAG ATAATACTAC GATCACCGGT GCGACAGCAT CAACCTATCC GGCAGCAACA GCAGGCAGCT ACACGGTAAC GGTTACGTCA AATAACTGTT CAGAAACTTC GGCAGCAAAG GTTGTAACGG TAAATGCCTT GCCGGCAGCA ACGATCACCA CAACAACGGC AACCACATTC TGTGCCGGTG GAAGTGTGAG CTTAGCCGCG AATACAGGCA CAGGCTTAAC GTATGTCTGG AAGAAAGATA ATACTACGAT CACCGGTGCG ACAGCATCAA CCTATCCGGC AGCAACAGCA GGCAGCTACA CGGTAACGGT TACGTCAAAT AACTGTTCAG AAACTTCGGC AGCAAAGGTT GTAACGGTAA ATGCCTTGCC GGCAGCAACG ATCACCACAA CAACGGCAAC CACATTCTGT GCCGGTGGAA GTGTGAGCTT AGCCGCGAAT GCAGGTGCAG GCTTAACGTA TGTATGGAAG AAAGATAATA CTACGATCAC CGGTGCAACA GCATCCACCT ATCCGGCAGC AACAGCAGGA AGCTATACAG TAACGGTTAC GTCAAATAAC TGTTCAGAAA TTTCCGCAGC AAAGGTTGTA ACGGTAAATG CCTTGCCGGC AGCAACGATC ACCACAACAA CGGCAACCAC ATTCTGTGCC GGTGGAAGTG TGAGCTTAGC CGCGAATACA GGCACAGGCT TAACGTATGT CTGGAAGAAA GATAATACTA CGATCACCGG TGCAACAGCA TCCACCTATC CGGCAGCAAC AGCAGGCAGC TACACGGTAA CGGTTACGTC AAATAACTGT TCAGAAACTT CGGCAGCAAA GGTTGTAACG GTAAATGCCT TGCCGGCAGC AACGATCACC ACAACAACGG CAACCACATT CTGTGCCGGT GGAAGTGTGA GCTTAGCCGC GAATACAGGT GCAGGCTTAA CGTATGTATG GAAGAAAGAT AATACTACGA TCACCGGTGC AACAGCATCC ACCTATCCGG CAGCAACAGC AGGAAGCTAT ACAGTAACGG TTACGTCAAA TAACTGTTCA GAAATTTCCG CAGCAAAGGT TGTAACGGTA AATGCCTTGC CGGCAGCAAC GATCACCACA ACAACGCCAA CCACATTCTG TGCGGGCGGA AGTGTGAACT TAGCCGCGAA TACAGGTGCA GGTTTAACGT ATGTATGGAA GAAAGATAAT ACCACCATTA CCGGTGCGAC AGCATCCACC TATCCGGCAG CAATAGCAGG CAGCTACACG GTAACGGTTA CGTCAAATAA CTGTTCAGAA ACTTCGGCAG CAAAGGTTGT AACGGTTACA GCTGCAACAA CCTGGTATCA GGATCTCGAT GGTGATGGAA AAGGGAATGC GGCTGTTACA CAGACAGCAT GCACGCAGCC TGCAGGCTAT GTATCGGTAG CAGGCGATGC CTGTCCGTCT GACCCGGATA AACTGATTGC CGGAGACTGT GGCTGCGGTA TAGCAGAAGG AACATGTACC GATTGTGCCG GTGTAATTAA CGGAAAAGCA GCACGTGATG TTTGTAATGT TTGTTCCGGA GGTACAACAG GTATTAATCC GATTACGGAT ATTTCTCAAT GCGGTCCGGT AACAGCTATT GAAAATAGTC TGTCGGCTGA TCTGCATCTG TATCCGAATC CATATGAAAC TGAACTGTAC ATAGAAGCTG GTACCGGAGA ATTTATGATT GTGGTATACA ACAATTCCGG ACTGGAAGTT CTTAGAGGTA CCTATGAATC ACAGGCGCTT ATCGGTGCCG GATTAGCGCC GGGCATATAT TTAATCCGTA TTGAAAAAAA CGGTCTTACA GAGACCCGGA AAATAATAAA AAAATAA
|
Protein sequence | MKKSLLLFAL IFTSVIASVG QQVTIINKKF VVNGNASCPI YFNGANTPWD NWNDFGGNYD AAFWSAHFAT LKANGINATR VWISCNGDVQ PNINTDGTVT GVSTQFWANV DDFFQSAKNN GIYVMATMMS FDHTKNTYTK YQSWRNMLND QAKVQSYCDN YLVPFVNRYK TNPYLMSIDI SNEIEWVAED ANNMKCSYAV LQRFVAMCAS AIHNNPRTDG TSVLVTMGSA ATKWNATKMR IGQNGAWSQN NSDGNKWSDA ALKAQYNQAN AVLDFYSPHY YAWIDGYYSN PYVRTPSDFG MDEKAVLIGE TPAGNPGTPN LTPLASYEAL KNNGYQGHFP WTSNSVDSNG GIEKFGTDAK TFSTTYSALV KPTCAVACTT PAPTVTTPVV YCKNASAVAL TATGTALKWY TDNTTTTAFS TTPIPSTTVA GTTSYYVSQT LNGCEGTRAA VQVTVKELPA ATITTTTATT FCAGGSVSLA ANTGTGLTYV WKKDNTTITG ATASTYPAAT AGSYTVTVTS NNCSETSAAK VVTVNALPAA TITTTTATTF CAGGSVSLAA NTGTGLTYVW KKDNTTITGA TASTYPAATA GSYTVTVTSN NCSETSAAKV VTVNALPAAT ITTTTATTFC AGGSVSLAAN AGAGLTYVWK KDNTTITGAT ASTYPAATAG SYTVTVTSNN CSEISAAKVV TVNALPAATI TTTTATTFCA GGSVSLAANT GTGLTYVWKK DNTTITGATA STYPAATAGS YTVTVTSNNC SETSAAKVVT VNALPAATIT TTTATTFCAG GSVSLAANTG AGLTYVWKKD NTTITGATAS TYPAATAGSY TVTVTSNNCS EISAAKVVTV NALPAATITT TTPTTFCAGG SVNLAANTGA GLTYVWKKDN TTITGATAST YPAAIAGSYT VTVTSNNCSE TSAAKVVTVT AATTWYQDLD GDGKGNAAVT QTACTQPAGY VSVAGDACPS DPDKLIAGDC GCGIAEGTCT DCAGVINGKA ARDVCNVCSG GTTGINPITD ISQCGPVTAI ENSLSADLHL YPNPYETELY IEAGTGEFMI VVYNNSGLEV LRGTYESQAL IGAGLAPGIY LIRIEKNGLT ETRKIIKK
|
| |