Gene CHU_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3233 
Symbol 
ID4184329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3686594 
End bp3689740 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content38% 
IMG OID638073226 
ProductWD-40-like repeat-containing protein 
Protein accessionYP_679816 
Protein GI110639606 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCTA GCTATAGAGG TATACAAATC GTTACCCTTT TATTTTTTCT TCTATCAACA 
TTTACTTCTG TTGCTCAGAC GTCGCAGGAT CAGTTTGGGA AAAATAGAAT TCAATACAAA
AAATTTGATT GGAAACTTAT TTCCACAGAA AATTTTGACA TCTATTATTA TTACGAAGGT
GAATCCTTTG CGGGCATAGC TGCAACACAC GCGGAATCTG TATTTAAAAA ACTTGAAAAT
TCATTAGGCT ATTCAAATTA CAATAAAACT AAAATACTCC TGTATAATTC GGTTGCTGAT
CTGCAGCAAA GCAATATAGG CTTGCAGCAG GGTACACAGG TAGGTGGATC CACAAATCTT
GTGAAAGCAA CTGTGGAAGT TGCTTTTGAA GGCGATTTAA TGACATTCAG AAACGACATT
ACACAAGGGA TTGCACGCAC ATGGATAAAT ATTTTACTAT ATGGCGGTAG TTTCAAGGAA
GTAGTTCAAA GCTCGTATTT CTTATCCCTG CCAGATTGGT ATGTAAAAGG GGCATCGGCA
TACATCGCAC GAGGATGGGA TAGAGAGATG GATAATTATA TCCGCGATCT GGTAATAAAC
AATAAATTAA GAAACCCCTC TTCGTATTCC GGGGAAGAAG CAGAATGGAT CGGACAGTCA
ATCTGGCATT TTATATCCGA TCGATATGGC CCGGATATGT TTGCAAATAT TTTACAGATC
ACCCGCATAT ACAGAAGTGA TAAGGCTTCC ATAGAGGCAG CTACGGGGTC ATACTTTCAG
ACATTCATTG ATAACTGGAA AGCGTATTAT AAAGAAAATG CCGTTGCTCT GGATGGAGCT
ATCCTATTTG ATACAACGTA TAAATCGTTA AACGGAAAAA ATTTTAAAGG ACGCGATATT
ACTACACCAG TGTTAAGTAC AGATGGTAAA TACCTGGCTT ACTCTGTTAA CAACAAAGGG
AGATACAGCG TTTTACTAAG GGAAGTTGAA AAAAGAAAAA AGAAAAAAAT TGCAGGCGGT
GGTTTTAAAT TAAACGACCA GCAGCCGATT ACCCGTAATC CATTACTGGC GTGGCAAACA
GAATCCATCT TAGCAAGCAT TACCTACAAA AAGGGTAAGA TAATTTACGA AGTGGTTGAT
GTAACAAAAA AGAAGCGCAT TGAAAAACGT GAAATAGAAG TGTTTCATCA GATCAACAGT
TATGCCTTTA GTACAGATGG AAAATATGTG GTGTATAGTG CCGATGAAAA AGGGCAATCA
GATATATGGC TGTACGATCT TGACCGGTCT AAGTATTTAA AAGTTACCAA CGATGAATAC
GATGATCATT CGCCGCGGTT TGGGCCGGGC AATGCGGTGA TCTTTTCATC CAACAGAAAC
GACGATACAC TGTTTACATT AACACGTTAT CAGTCGGCAC CTAATTATTC GCTTTACATA
TATAAGCCCG GCGATAAAGT TTTAAAACGT ATTCCGGCAC AAGCATCTAA TTTTACAAAA
CCTGCATTTA TCAATGCCAC AACCATCCTT TATTTAAATG ATGAAAATGG CGTGCAGAAT
TTGTATACAC AGGATCTGAA GACAGGCGTA TCAAAGCCCT TAACAAATAA TACAACAGAT
ATTTTAGAAT ACGATTTTAA TGCAGCAAAC AATTATCTTG CTTTCATTGC ACGATCAAGA
GGCAAAGAAA AGATTTTTAT TGATACAGCC TTTTCGTTTA CCAATGCGGT TACCATAACC
GGCAAAACAG ATCAGTCAGC GGTGATGCGT AAAAAAATAC TGCCTCCTGA AAAACCAAAA
CAAACAAACA ACCTGATTGT TGTATCACCT AAGAAAAATG ACGATGATAT TGATCTGGAT
AAAGTATATT TTGAATCCGA CACGACACTG AAACTGCCTA AAAAAGTTAC ACCGGCAGAG
GCAAGTAATA CACCGCCTAA GCGTACTATT ATTCCTTTTT ATGGCCCGTA CCCGTATAAG
AATTTATTCG GAACGGATAT GGTTGCTACT ACCTTACAGG TTGATCCGTT AAGAGGATTT
GGTTTGCTGC TGGAATCGCA GATGTCTGAT CTGATGTCTA ATCACAGAGT TAATATGGGT
TTCTTTGGTG TGGCTGATTT TAAAAGCAGC AGTTTATACG GCGAATATGC GTATTTGAAA
AACCGTGTTG ATCTAAGCAC ACGTTTCGAC CGCATCACAT ATTTTCCTTC AAACGGAAGT
GTAAGTCACC GCTATGTGGT AAACAAAGTA GAAATTAAAG CATCCTATCC GTTATCTGTT
TACAGCAGAG TAAGTGTTTC GCCGTTTGCT CAACAAGTAC GGTTTTCAGA TGTAACTGAT
TTTAATACGG CCGTTACATA TCAGGATGAA AAGAAAGCCT ATGTTGGTGC AAAATTCGAA
TTCGTGTACG ACAATACAAT CAGTTCAGGT ATGAATATGA TTGAAGGAAC ACGAGCAAAA
ATTTTATTTG AAACAAATAA GAATACCGTT TCAAAAGCAG AAAACTTCAA CCGTTTTAAT
ATTGATTTAC GCCACTATCA AAAAATTCAC AGAAGTTCTG TGCTTGCATT GAGAGCATCG
TACGGAAGAT TTTTCGGAGC AGCACCAAAA ACATTTATCT TCGGTGGTAT GGATAACTGG
TTTTTTAATT CTGTCGGAAC GGGTGGGGTA ACAAACCCGT TATCGTATGA AACAGGTGTA
AACAATTCAG ACATTATGTT TGATAAATAT GTTACTAATG TGCGCGGATT TAAATACAAC
GATCAATCCG GACAAAGTTA TTTCCTGTTA AATGCAGAAT TCAGACTGCC GATTATTAAA
TATTTATATA GCGGATCGGT ATCATCTGCA TTCTTACGCA ATTTCCAATT GGTCGCATTT
ACAGATGTAG GTGCCGCATG GGATGGGGCA AATCCTTTTT CAACAAACAA TTCGTTGAAC
AGAAAAATTA TTGCAGCCGG TGCCAACAAT TCATCTCCAT TTGAGATAGA AGTAAATAAT
TACAGAAATC CATTTTTATA TGGTTACGGT TTGGGAGCCC GTACATTTTT GTTTGGCTAT
TATATGAAGG GCGATTTAGC CTGGGGTATA AAAGATGGCG AACAACTTGC GCCAAGATTT
TACTTTACAT TCGGATACGA TTTTTAG
 
Protein sequence
MQPSYRGIQI VTLLFFLLST FTSVAQTSQD QFGKNRIQYK KFDWKLISTE NFDIYYYYEG 
ESFAGIAATH AESVFKKLEN SLGYSNYNKT KILLYNSVAD LQQSNIGLQQ GTQVGGSTNL
VKATVEVAFE GDLMTFRNDI TQGIARTWIN ILLYGGSFKE VVQSSYFLSL PDWYVKGASA
YIARGWDREM DNYIRDLVIN NKLRNPSSYS GEEAEWIGQS IWHFISDRYG PDMFANILQI
TRIYRSDKAS IEAATGSYFQ TFIDNWKAYY KENAVALDGA ILFDTTYKSL NGKNFKGRDI
TTPVLSTDGK YLAYSVNNKG RYSVLLREVE KRKKKKIAGG GFKLNDQQPI TRNPLLAWQT
ESILASITYK KGKIIYEVVD VTKKKRIEKR EIEVFHQINS YAFSTDGKYV VYSADEKGQS
DIWLYDLDRS KYLKVTNDEY DDHSPRFGPG NAVIFSSNRN DDTLFTLTRY QSAPNYSLYI
YKPGDKVLKR IPAQASNFTK PAFINATTIL YLNDENGVQN LYTQDLKTGV SKPLTNNTTD
ILEYDFNAAN NYLAFIARSR GKEKIFIDTA FSFTNAVTIT GKTDQSAVMR KKILPPEKPK
QTNNLIVVSP KKNDDDIDLD KVYFESDTTL KLPKKVTPAE ASNTPPKRTI IPFYGPYPYK
NLFGTDMVAT TLQVDPLRGF GLLLESQMSD LMSNHRVNMG FFGVADFKSS SLYGEYAYLK
NRVDLSTRFD RITYFPSNGS VSHRYVVNKV EIKASYPLSV YSRVSVSPFA QQVRFSDVTD
FNTAVTYQDE KKAYVGAKFE FVYDNTISSG MNMIEGTRAK ILFETNKNTV SKAENFNRFN
IDLRHYQKIH RSSVLALRAS YGRFFGAAPK TFIFGGMDNW FFNSVGTGGV TNPLSYETGV
NNSDIMFDKY VTNVRGFKYN DQSGQSYFLL NAEFRLPIIK YLYSGSVSSA FLRNFQLVAF
TDVGAAWDGA NPFSTNNSLN RKIIAAGANN SSPFEIEVNN YRNPFLYGYG LGARTFLFGY
YMKGDLAWGI KDGEQLAPRF YFTFGYDF