Gene CHU_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2149 
Symbol 
ID4187073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2497450 
End bp2500776 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content46% 
IMG OID638072149 
Productglycoside hydrolase family 5 protein 
Protein accessionYP_678754 
Protein GI110638545 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000532282 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GTTTACTGCT CTTTGCGCTT ATTTTTACAT CTGTGATTGC CTCTGTTGGG 
CAGCAGGTAA CCATTATTAA CAAAAAATTT GTCGTGAACG GCAATGCTTC CTGTCCGATT
TACTTCAACG GAGCTAATAC ACCCTGGGAC AACTGGAACG ACTTCGGTGG AAATTACGAT
GCAGCATTCT GGTCTGCACA TTTCGCAACC CTAAAAGCAA ACGGGATCAA TGCTACCCGC
GTATGGATCA GCTGTAACGG AGATGTGCAG CCCAATATCA ATACCGATGG AACAGTAACC
GGCGTCAGCA CACAATTCTG GGCCAACGTC GATGACTTTT TTCAATCTGC AAAAAATAAC
GGAATCTATG TAATGGCAAC TATGATGTCG TTTGACCATA CAAAAAACAC GTATACAAAA
TATCAGAGCT GGAGAAACAT GCTGAACGAT CAGGCAAAAG TTCAATCCTA TTGCGATAAC
TACCTTGTAC CTTTTGTGAA TCGCTACAAA ACGAATCCGT ATCTGATGTC TATTGATATT
TCAAATGAAA TTGAATGGGT TGCAGAAGAT GCAAACAATA TGAAATGTTC GTATGCCGTA
CTGCAGCGCT TTGTTGCCAT GTGTGCTTCA GCCATTCATA ATAATCCAAG AACAGATGGT
ACATCGGTAT TGGTTACGAT GGGTTCGGCA GCTACTAAAT GGAATGCTAC TAAAATGCGT
ATCGGTCAAA ATGGTGCCTG GTCACAGAAT AATTCAGATG GAAATAAGTG GAGCGATGCT
GCTTTAAAGG CACAATACAA CCAGGCTAAC GCGGTGCTGG ATTTTTATTC CCCGCATTAT
TATGCCTGGA TCGATGGATA TTATTCTAAC CCGTATGTGC GTACACCGAG TGATTTCGGT
ATGGATGAAA AGGCTGTTCT TATTGGTGAA ACACCAGCGG GCAATCCCGG CACGCCAAAC
CTTACGCCAT TGGCATCTTA TGAAGCGTTA AAAAATAATG GCTATCAGGG GCATTTTCCA
TGGACATCAA ATAGCGTAGA CAGTAACGGA GGGATTGAAA AATTCGGTAC AGATGCAAAA
ACATTTTCAA CAACATATAG CGCACTTGTA AAACCAACCT GTGCAGTAGC TTGTACAACA
CCGGCTCCAA CGGTAACAAC ACCTGTCGTT TATTGTAAAA ATGCGTCAGC TGTTGCGTTA
ACAGCAACAG GCACAGCCCT GAAATGGTAT ACAGATAATA CAACCACAAC GGCATTTTCC
ACAACACCGA TACCTTCAAC AACAGTAGCT GGTACAACAA GTTATTATGT TTCACAGACG
CTAAATGGTT GTGAAGGAAC AAGAGCAGCA GTACAAGTAA CAGTAAAAGA ACTACCGGCA
GCAACGATCA CCACAACAAC GGCAACCACA TTCTGTGCCG GTGGAAGTGT GAGCTTAGCC
GCGAATACAG GCACAGGCTT AACGTATGTC TGGAAGAAAG ATAATACTAC GATCACCGGT
GCGACAGCAT CAACCTATCC GGCAGCAACA GCAGGCAGCT ACACGGTAAC GGTTACGTCA
AATAACTGTT CAGAAACTTC GGCAGCAAAG GTTGTAACGG TAAATGCCTT GCCGGCAGCA
ACGATCACCA CAACAACGGC AACCACATTC TGTGCCGGTG GAAGTGTGAG CTTAGCCGCG
AATACAGGCA CAGGCTTAAC GTATGTCTGG AAGAAAGATA ATACTACGAT CACCGGTGCG
ACAGCATCAA CCTATCCGGC AGCAACAGCA GGCAGCTACA CGGTAACGGT TACGTCAAAT
AACTGTTCAG AAACTTCGGC AGCAAAGGTT GTAACGGTAA ATGCCTTGCC GGCAGCAACG
ATCACCACAA CAACGGCAAC CACATTCTGT GCCGGTGGAA GTGTGAGCTT AGCCGCGAAT
GCAGGTGCAG GCTTAACGTA TGTATGGAAG AAAGATAATA CTACGATCAC CGGTGCAACA
GCATCCACCT ATCCGGCAGC AACAGCAGGA AGCTATACAG TAACGGTTAC GTCAAATAAC
TGTTCAGAAA TTTCCGCAGC AAAGGTTGTA ACGGTAAATG CCTTGCCGGC AGCAACGATC
ACCACAACAA CGGCAACCAC ATTCTGTGCC GGTGGAAGTG TGAGCTTAGC CGCGAATACA
GGCACAGGCT TAACGTATGT CTGGAAGAAA GATAATACTA CGATCACCGG TGCAACAGCA
TCCACCTATC CGGCAGCAAC AGCAGGCAGC TACACGGTAA CGGTTACGTC AAATAACTGT
TCAGAAACTT CGGCAGCAAA GGTTGTAACG GTAAATGCCT TGCCGGCAGC AACGATCACC
ACAACAACGG CAACCACATT CTGTGCCGGT GGAAGTGTGA GCTTAGCCGC GAATACAGGT
GCAGGCTTAA CGTATGTATG GAAGAAAGAT AATACTACGA TCACCGGTGC AACAGCATCC
ACCTATCCGG CAGCAACAGC AGGAAGCTAT ACAGTAACGG TTACGTCAAA TAACTGTTCA
GAAATTTCCG CAGCAAAGGT TGTAACGGTA AATGCCTTGC CGGCAGCAAC GATCACCACA
ACAACGCCAA CCACATTCTG TGCGGGCGGA AGTGTGAACT TAGCCGCGAA TACAGGTGCA
GGTTTAACGT ATGTATGGAA GAAAGATAAT ACCACCATTA CCGGTGCGAC AGCATCCACC
TATCCGGCAG CAATAGCAGG CAGCTACACG GTAACGGTTA CGTCAAATAA CTGTTCAGAA
ACTTCGGCAG CAAAGGTTGT AACGGTTACA GCTGCAACAA CCTGGTATCA GGATCTCGAT
GGTGATGGAA AAGGGAATGC GGCTGTTACA CAGACAGCAT GCACGCAGCC TGCAGGCTAT
GTATCGGTAG CAGGCGATGC CTGTCCGTCT GACCCGGATA AACTGATTGC CGGAGACTGT
GGCTGCGGTA TAGCAGAAGG AACATGTACC GATTGTGCCG GTGTAATTAA CGGAAAAGCA
GCACGTGATG TTTGTAATGT TTGTTCCGGA GGTACAACAG GTATTAATCC GATTACGGAT
ATTTCTCAAT GCGGTCCGGT AACAGCTATT GAAAATAGTC TGTCGGCTGA TCTGCATCTG
TATCCGAATC CATATGAAAC TGAACTGTAC ATAGAAGCTG GTACCGGAGA ATTTATGATT
GTGGTATACA ACAATTCCGG ACTGGAAGTT CTTAGAGGTA CCTATGAATC ACAGGCGCTT
ATCGGTGCCG GATTAGCGCC GGGCATATAT TTAATCCGTA TTGAAAAAAA CGGTCTTACA
GAGACCCGGA AAATAATAAA AAAATAA
 
Protein sequence
MKKSLLLFAL IFTSVIASVG QQVTIINKKF VVNGNASCPI YFNGANTPWD NWNDFGGNYD 
AAFWSAHFAT LKANGINATR VWISCNGDVQ PNINTDGTVT GVSTQFWANV DDFFQSAKNN
GIYVMATMMS FDHTKNTYTK YQSWRNMLND QAKVQSYCDN YLVPFVNRYK TNPYLMSIDI
SNEIEWVAED ANNMKCSYAV LQRFVAMCAS AIHNNPRTDG TSVLVTMGSA ATKWNATKMR
IGQNGAWSQN NSDGNKWSDA ALKAQYNQAN AVLDFYSPHY YAWIDGYYSN PYVRTPSDFG
MDEKAVLIGE TPAGNPGTPN LTPLASYEAL KNNGYQGHFP WTSNSVDSNG GIEKFGTDAK
TFSTTYSALV KPTCAVACTT PAPTVTTPVV YCKNASAVAL TATGTALKWY TDNTTTTAFS
TTPIPSTTVA GTTSYYVSQT LNGCEGTRAA VQVTVKELPA ATITTTTATT FCAGGSVSLA
ANTGTGLTYV WKKDNTTITG ATASTYPAAT AGSYTVTVTS NNCSETSAAK VVTVNALPAA
TITTTTATTF CAGGSVSLAA NTGTGLTYVW KKDNTTITGA TASTYPAATA GSYTVTVTSN
NCSETSAAKV VTVNALPAAT ITTTTATTFC AGGSVSLAAN AGAGLTYVWK KDNTTITGAT
ASTYPAATAG SYTVTVTSNN CSEISAAKVV TVNALPAATI TTTTATTFCA GGSVSLAANT
GTGLTYVWKK DNTTITGATA STYPAATAGS YTVTVTSNNC SETSAAKVVT VNALPAATIT
TTTATTFCAG GSVSLAANTG AGLTYVWKKD NTTITGATAS TYPAATAGSY TVTVTSNNCS
EISAAKVVTV NALPAATITT TTPTTFCAGG SVNLAANTGA GLTYVWKKDN TTITGATAST
YPAAIAGSYT VTVTSNNCSE TSAAKVVTVT AATTWYQDLD GDGKGNAAVT QTACTQPAGY
VSVAGDACPS DPDKLIAGDC GCGIAEGTCT DCAGVINGKA ARDVCNVCSG GTTGINPITD
ISQCGPVTAI ENSLSADLHL YPNPYETELY IEAGTGEFMI VVYNNSGLEV LRGTYESQAL
IGAGLAPGIY LIRIEKNGLT ETRKIIKK