Gene CHU_3353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3353 
Symbol 
ID4185048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3832413 
End bp3833423 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content37% 
IMG OID638073342 
Productvirulence protein 
Protein accessionYP_679932 
Protein GI110639722 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0319779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0105196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA AGCAAAACAT CATTATTTAT AACACCCAAG ATGGGAAAGC TGCTGTTTCC 
TTATATGCAA AAGATGGCTC TGTATGGATG AATCAGCAGC AACTGGCAGA GCTTTTTGAC
ACAACCAAGC AAAATATAAG TCTGCACATT CTTAATATAC TTGAAGAGAA CGAGTTAAAT
GAAGCGGCAG TTGTCAAGGA TTACTTGACA ACTGCCGCGG ATGGAAAAAA CTATAATGTA
ACTTTTTACA GCCTGGATAT GATCCTTGCA ATCGGGTTCA GGGTTAGAAG TAAAAGAGGG
ACACAATTCA GGCAGTGGGC AAATCGTAAC TTAAAAGAGT ACATGGTAAA AGGATTTATC
ATGGACGATG AGCGATTGAA AAATCCGGAT GGCAGACCTG ATTATTTTGA TGAATTGCTG
GCTCGTATCA GAGATATACG TGCTTCTGAG AAAAGATTCT ATCAAAAAGT ACGTGACCTG
TTTGCGCTTA GTAATGATTA CGATAGCACC GACAAAACCA CACAATTGTT TTTTGCCGAA
ACACAAAATA AGCTGTTGTT TGCAATTACA GGAAAAACAG CAGCGGAAAT AATTGTAAGC
AGAGCTAAAG CCGATGAACC CAATATGGCT TTGACCAGTT GGGAAGGAAG TATTGTACGA
AAGCAAGACA CCTTTATTGC TAAAAACTAT TTAACAGATG ATGAAGTTGA TAGTCTTAAT
CGTTTTGTAG TTGTGTTTCT GGAAACCGCT GAATTGAGAG CGAAAAACAG ACAGGATATC
ACAATGAATT TTTGGAGGGA AAACGTTGAT AAAATTATAG CGCTTAACGA TAAACCTATA
CTGAAAGGTA AGGGAAGTAT TAGCCATACA CAGATGGAAA AAATGATAGA GCACGTATAT
AAAACATTTG ATGCGAAACG AAAACTTGAA GATGCTCTGA ATGCGGATGC GGAAGATCTG
AAAGAGATAA AATCATTAGA AGATAAAATT AAGAACAGAA AAAATAAATA G
 
Protein sequence
MQEKQNIIIY NTQDGKAAVS LYAKDGSVWM NQQQLAELFD TTKQNISLHI LNILEENELN 
EAAVVKDYLT TAADGKNYNV TFYSLDMILA IGFRVRSKRG TQFRQWANRN LKEYMVKGFI
MDDERLKNPD GRPDYFDELL ARIRDIRASE KRFYQKVRDL FALSNDYDST DKTTQLFFAE
TQNKLLFAIT GKTAAEIIVS RAKADEPNMA LTSWEGSIVR KQDTFIAKNY LTDDEVDSLN
RFVVVFLETA ELRAKNRQDI TMNFWRENVD KIIALNDKPI LKGKGSISHT QMEKMIEHVY
KTFDAKRKLE DALNADAEDL KEIKSLEDKI KNRKNK