Gene CHU_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0854 
Symbol 
ID4184213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp978072 
End bp979172 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content35% 
IMG OID638070856 
Productcapsular polysaccharide biosynthesis protein 
Protein accessionYP_677477 
Protein GI110637270 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0933972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.113997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGTCTTC AACGGATCTG TTACGAAAAG ATTTTTTTGA AAAAGAAAAT AATAGCAACT 
GGAACAATCT GGATGAGTAA CATGCAGCAA ATAAATAATG CTTCTGTAAG GCCTGTTAGT
GCTTACAATA CAAAGCATCT GCAAATACCT GCAATTGCAT TGCTTCAATC TTCTCCATTG
CCTGAACAGG CCATACGTAC ATTTAGCAGG GCGCTTCTGA CACCAACAAA CCATTTATTC
GTAAATGGCA GATACATTCG CACCGGATTA ATTTCCGGTT TCGATTCAAG AAAGCTTTCT
TTTTTTCAAA GAATATATTT ATTCTTAAAA TCACGATTTT TTACAGAAAA TAAATCAGTT
AAAATTTCAG CAGTATGGGC GCATGATAGC TGGAGTAATA ATTATTTCCA CTGGTTTAAT
GACACGTTGC CTCGATTATT TTTATTGAGT AAACAAATTG AAGACTCGGT TGCGGTATTG
CCTGTTGAAT TAAGTAAGAT CACATTCATT GTTGAATCAT TGGAGTTACT TAAAATTGAA
CATCAATGGA TTGATCAGAA AAAGTCTCAT CGGTTTGAAT CGTTAAGTGT ATTACATACG
GCAACACTTC AGCCTGACAT TAATCCGTTG CTTCAAAAAC AGATGCGCGA CGCTGTTTTT
TCAGCAATGA AAATTGACCC GCAAGAAAGA CCTTTCAGGA AAATATATAT TTCGAGAGCA
CATGCGAGGT ATAGGAAAAT TATAAATGAA CAGGAATTAT TGCCTGTACT GAAAAAATAT
GGATATGATA TTATTTATCC TGAAACATAT TCTTTTAAAG AACAGGTAAA ACTTTTTGCT
GAGTCAAATG CGTTAATTTC TATTCATGGA GCAGGGCATA CAAACTGCAT GTTTATGAAG
CAAGATGCTA AAGTGATGGA AATACGAAAT ACTGAATGGG AGTCGCAGCC ACTTTGCTTC
TGGGGGTTGG CAAATATTTT TGAATTAAAG TGGGAATATA TTACAGCCAC ACGGGTAAGT
GAAGTTTCGA ATTTTAATGA TGTTTTTATA GCTCCACATA TATTTGAAGA ATCGTTACGG
ACATTTGAAA ACATTAAATA A
 
Protein sequence
MCLQRICYEK IFLKKKIIAT GTIWMSNMQQ INNASVRPVS AYNTKHLQIP AIALLQSSPL 
PEQAIRTFSR ALLTPTNHLF VNGRYIRTGL ISGFDSRKLS FFQRIYLFLK SRFFTENKSV
KISAVWAHDS WSNNYFHWFN DTLPRLFLLS KQIEDSVAVL PVELSKITFI VESLELLKIE
HQWIDQKKSH RFESLSVLHT ATLQPDINPL LQKQMRDAVF SAMKIDPQER PFRKIYISRA
HARYRKIINE QELLPVLKKY GYDIIYPETY SFKEQVKLFA ESNALISIHG AGHTNCMFMK
QDAKVMEIRN TEWESQPLCF WGLANIFELK WEYITATRVS EVSNFNDVFI APHIFEESLR
TFENIK