Gene CHU_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0228 
Symbol 
ID4185529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp277701 
End bp278954 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content43% 
IMG OID638070238 
Producthypothetical protein 
Protein accessionYP_676860 
Protein GI110636653 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.008996 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCAGACA GCATCCAGGA AAAACTTCAG ATACTTGCAG ATGCAGCGAA ATACGACGTA 
TCCTGCTCTT CAAGTGGCAG CAAACGCAAA AATCACAACA AAGGCTTAGG CGATACGGGC
AATGGTATCT GCCATACATA TACCGAAGAC GGACGCTGCG TATCGTTGTT AAAGATCCTG
CTTACAAATG TATGTATCTA CGATTGTGCT TATTGTGTTA CCCGCAAAAG CAACGATATA
CAGCGCGCAG CTTTTACCGT GCAGGAAGTC GTTGATCTCA CCATCAATTT TTACCGGCGC
AATTATATTG AAGGATTATT TTTAAGTTCA GGTATTTTTA AAAATGCCGA TTATACCATG
GAACGCCTGG TACTGATCGC AAAGAAACTA CGCACCGAAC ATAGGTTCAA CGGATACATT
CACCTTAAAT CCATTCCCGG GGCCAGCGAC GAAATCATGC ATGAAGCAGG CCTCTACGCG
GACCGCTTAA GCATTAACAT TGAGATCCCT ACTGAAACGG GCTTGAAATT ACTGGCTCCC
GACAAGAACA GAACCGATAT GATTCAGCCG ATGACGTATC TGAAAAATGA AATCATCCTG
AAGCAAGATG AAAAAAAACT ATTTAAGAAA GCGCCTGTGT TTGCTCCTGC CGGACAAAGT
ACGCAAATGA TTATCGGTGC TGCGAAAGAA TCAGATAAAG ATATTATGCA GCTTTCTGCA
AGCTTTTATA AAAACTTTAA TCTGAAAAGG GTGTACTATT CCGGCTATGT ACCGATCAGT
AACGACGGAC GATTACCGGG CATTGGCAGT GCCGTGCCTA TGGTACGTGA AAACAGACTA
TACCAGACGG ATTGGCTGCT GCGCTTCTAT GGCTTTAAGG TAGATGAAAT TGTAAACGAT
CAGCATCCGA ATCTGGATCT GGATATTGAT CCGAAATTAA GCTGGGCACT GCGCAACCTA
AATGTTTTTC CTATTGATAT AAACAAGGCC GACATTCAGC TAATCCTTCG TGTACCGGGC
ATAGGCCTTC AATCTGCACA AAAAATTACT GCAGCGCGAA AGTTTCAGAA ATTAAATTGG
GAACATCTGA AGAAGATCGG TATTGCGGTA AACCGTGCAA AGTATTTTAT TACCTGCAGC
AGCAGCGAGT TTGAGCGCAG GGATTTAACG GAGGCACGCA TCAAACAGTT TATATTGTCG
GGTTCAAGTT CCAAATATTT AAAAACTGCC AGCCAGCAAT TAGTCCTTTT CTGA
 
Protein sequence
MSDSIQEKLQ ILADAAKYDV SCSSSGSKRK NHNKGLGDTG NGICHTYTED GRCVSLLKIL 
LTNVCIYDCA YCVTRKSNDI QRAAFTVQEV VDLTINFYRR NYIEGLFLSS GIFKNADYTM
ERLVLIAKKL RTEHRFNGYI HLKSIPGASD EIMHEAGLYA DRLSINIEIP TETGLKLLAP
DKNRTDMIQP MTYLKNEIIL KQDEKKLFKK APVFAPAGQS TQMIIGAAKE SDKDIMQLSA
SFYKNFNLKR VYYSGYVPIS NDGRLPGIGS AVPMVRENRL YQTDWLLRFY GFKVDEIVND
QHPNLDLDID PKLSWALRNL NVFPIDINKA DIQLILRVPG IGLQSAQKIT AARKFQKLNW
EHLKKIGIAV NRAKYFITCS SSEFERRDLT EARIKQFILS GSSSKYLKTA SQQLVLF