Gene CHU_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1024 
SymbolyqiK 
ID4183761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1178109 
End bp1181189 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content45% 
IMG OID638071022 
Producthypothetical protein 
Protein accessionYP_677641 
Protein GI110637434 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000341432 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATC CATTTCCCGG ACTGCGTGCA TTCAACGTTG ATGAAAGTCA TTTATTCTTC 
GGCAGGGAAG GACAAAGTGA TGAAGTATTG GATAAACTTT CAAAAAATAA ATTTGTAGGT
ATTATCGGTG CATCCGGCAG CGGTAAATCT TCATTTATGT TCTGCGGAGT GATTCCGATC
TTATACGGAG GTTTTCTTTC TCATGTGGGC CCTAACTGGC ACGTCATCAC CACACGTCCG
GGCGGATGTC CCATTGACAA TCTTTCTGAA GCCTTGCTGC AGAAAGATGC AGAATACCTG
GTAGCAGATG CAGAAGATAA GCGATTGAAA AAAACGATTA CTTCTACGTT ATTAAAAAGT
TCGTCGCTGG GTTTAATTGA AGCGGTAAGA CAACTGCATC TGGATAATGG GAATAACGTT
TTGATTGTTG CCGATCAGTT TGAAGAACTG TTCCGTTTTA AACGCACAGA AGATACCAAT
ACCACCAATG AATCCTTGGC ATACATTAAT TTGCTGATGG AAGCGGTTAA AGATCTGAAA
TCGAATATTT ACGTGGTAAT CACCATGCGT TCGGATTTTA TTGGCGACTG TGCACAGTTT
CCTGAATTGA CAAAATACAT CAACGAGAGT CATTACCTGA TTCCTCAGAT GGTTCGAGAG
CAGAAGCGTC TGGCCATTGA AGGACCGGTA GCGGTAGGTG GTGCGGAGAT CTCTTCCCGC
CTTGTACAGC AGCTGTTAAA TGATCTGGGC GATAATCCCG ACCAGCTGCC GATCATTCAG
CACGCGTTAA TGCGTACCTG GACATTCTGG GCAGAGAATC ACGAGCCCGA AGAAATTCTG
GACCTGAGAC ATTACGAAGC AATCGGTTCC ATGTCGGGTG CTTTATCTCA ACACGCGGAT
GAAGCATACG ATGAACTTAC CGAACAGCAG AAATTTTACT GCGAGATTTT ATTTAAAACA
TTAACAGAGA AAGGTTCTGA TTCGGCTGGT ATACGCAGGC CCACAAAACT TTCCACTATT
GCGAATGTTG CCGGCTGCAG TGAAGACGAT ATGGCGCATA TCATTGATCA CTTCCGTATA
GAAGGCCGTT CGTTATTAAT GCCTCCGGTG CATGTTGCAT TGGCTTCAGA TAGCATCATT
GATATTTCAC ATGAAAGTCT CATGCGTATC TGGACCCGGT TGAAAAAATG GGTAGAAGAA
GAAGGTGAAT CTGCTCAGAT GTATATTCGT TTGTCTGATG CGGCATCCAG TTATCAGATC
GGAAAGGCAG GTTTATGGAG GCCACCCGAT TTGCAGCTTG CATTAAACTG GAAACAGAAA
AACAAACCTA CATTGGTTTG GGCGCAGCGC TACGACTCTG CCTTTGAGCG TGCAATGGTA
TTTTTGGATA CAAGTAATGA GGCACATGAA GAAGAGCAGC GTTTAAAAGA ACTTGCACAA
AAGCAAGCTT TAAAACGTGC CCGTATCATA GCGTTGGTTC TTGGTATCTT TACCATCATG
GCGCTTGGTT CACTGGTATT CTCTATTCTG AAAACCGTTG AAGCGAACAA TCAGCGGATC
GAGGCAGAAA AGCAAAAAGG GATCGCGTTG ACGCAGGAAG CTGAAGCTAA AAAGCAGACC
GGTATTGCTG AGCAGCAAAC ACTGGAAGCG CAGAGGCAGA AAGATTTTGC GGTACTTGCT
GCTGAAGAAG CAAAACATCA GCAAGGTATA GCGGAGAAAA ACTTTAACGA AGCACAGAAG
CAGCGGAACA TCGCTGTAAC CTATGCTACA GAAGCAACCA AACAGCAGAA GATCGCCGAG
ATGCAGCGCG AGCTTGCAGT GAAAAATGAG AAGCAGGCGA AACAGGAAAA GCAACGTGCC
GATAAACTTC GTTTGCTGTC TATTGCCCAG TCTATGGCAG TGAAATCTGT GCAGATGGAA
GAGGATACGA TGCGCAAAGC ATTACTGGCT TTTCAGGCGT ATGAATTCTA TAAAGAAAAT
GGCGGAAATG TAAACCAGCA TGAAATCTAT GACGGTTTGT ATTATGCGCT GAAAAATCTG
AAAGTAAAAG ATTACAATGC CCTGCATGCA CATAAAGATG CCGTGCGGTC AATTGCATAT
ACAGCAGATG GCAAAGGCAT GTACACAGCC GGAAGTGATG GTAAGATCTT CAGCTGGGAT
ATGACTGCTG CAAATCCGAA ACCAAAGACT GTATACAATA CCAATTATGC ACTCGGAGCA
TTGTCTTTAA GTTCGAACGG TACCATGCTT GCCAGCGGCG GAGCATCACC GAATATTCGT
ATGGTTAATC TTACGAAAGA AGGTGACGCA CCTTTACTAT TGAAAGGCCA TAAAAAAACG
GTATTGTACA CAGCATTTTC ACCGGACAAT CAAACGCTGG TATCTGCCAG TGCAGACAGT
ACGGTAATGA TCTGGAATCT TTCTTCCGGT GTACCGATAA CCGTTTACAG AGACAGACAT
AATATCAAAG CAGTATCGCT GCACCCGAAA GGAAGAGTAA TTGCGGTCGC CAATGATAAA
GGCGAAACCA TGATTATCTC CCTGTATAAT GAATTTACAC CCTATCTGAT TGATAGAGGA
ACGACAGCAG ACTACAGCCT GCAATACAGC CACGATGGGG AGTTTCTTGC GATTGCCAAT
AATTCAGGCC TGATCAAGAT CATGGATGTA GAAGGCAGAA GACTGGTTGT TGCATTACCC
GGACATAAGG CACGCGTGAA CGAAATGAAA TTCAGTAAAG ATGATTCCAA ACTTGCTTCT
GCAAGCTTTG ATGGTACCAT CCGCGTATGG GATCTTTCTG AATTGAGTGA GCAGCCGTTG
ATCTTAAAAG ATCACACGAA CTGGGTATGG TCTATGACCT TTAATGCAGA AGGCGATAAG
CTGATTGCCG GCTGCGGTGA TAACCTGATC CGGATCTGGC CGACAAGCAG CAAGATCATG
GCCGATCAGA TGTGCGACCT GATCAAACGA AACATGACCG GTGCGGAATG GAAACGCTAT
GTAGCAAAAG ATGGTGTACC GTATGAATTA ACCTGTCCGG CTTTACCAAG CCAGGAAACC
AATATGAATA CAATTTACTA A
 
Protein sequence
MQNPFPGLRA FNVDESHLFF GREGQSDEVL DKLSKNKFVG IIGASGSGKS SFMFCGVIPI 
LYGGFLSHVG PNWHVITTRP GGCPIDNLSE ALLQKDAEYL VADAEDKRLK KTITSTLLKS
SSLGLIEAVR QLHLDNGNNV LIVADQFEEL FRFKRTEDTN TTNESLAYIN LLMEAVKDLK
SNIYVVITMR SDFIGDCAQF PELTKYINES HYLIPQMVRE QKRLAIEGPV AVGGAEISSR
LVQQLLNDLG DNPDQLPIIQ HALMRTWTFW AENHEPEEIL DLRHYEAIGS MSGALSQHAD
EAYDELTEQQ KFYCEILFKT LTEKGSDSAG IRRPTKLSTI ANVAGCSEDD MAHIIDHFRI
EGRSLLMPPV HVALASDSII DISHESLMRI WTRLKKWVEE EGESAQMYIR LSDAASSYQI
GKAGLWRPPD LQLALNWKQK NKPTLVWAQR YDSAFERAMV FLDTSNEAHE EEQRLKELAQ
KQALKRARII ALVLGIFTIM ALGSLVFSIL KTVEANNQRI EAEKQKGIAL TQEAEAKKQT
GIAEQQTLEA QRQKDFAVLA AEEAKHQQGI AEKNFNEAQK QRNIAVTYAT EATKQQKIAE
MQRELAVKNE KQAKQEKQRA DKLRLLSIAQ SMAVKSVQME EDTMRKALLA FQAYEFYKEN
GGNVNQHEIY DGLYYALKNL KVKDYNALHA HKDAVRSIAY TADGKGMYTA GSDGKIFSWD
MTAANPKPKT VYNTNYALGA LSLSSNGTML ASGGASPNIR MVNLTKEGDA PLLLKGHKKT
VLYTAFSPDN QTLVSASADS TVMIWNLSSG VPITVYRDRH NIKAVSLHPK GRVIAVANDK
GETMIISLYN EFTPYLIDRG TTADYSLQYS HDGEFLAIAN NSGLIKIMDV EGRRLVVALP
GHKARVNEMK FSKDDSKLAS ASFDGTIRVW DLSELSEQPL ILKDHTNWVW SMTFNAEGDK
LIAGCGDNLI RIWPTSSKIM ADQMCDLIKR NMTGAEWKRY VAKDGVPYEL TCPALPSQET
NMNTIY