Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1024 |
Symbol | yqiK |
ID | 4183761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 1178109 |
End bp | 1181189 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 638071022 |
Product | hypothetical protein |
Protein accession | YP_677641 |
Protein GI | 110637434 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000341432 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATC CATTTCCCGG ACTGCGTGCA TTCAACGTTG ATGAAAGTCA TTTATTCTTC GGCAGGGAAG GACAAAGTGA TGAAGTATTG GATAAACTTT CAAAAAATAA ATTTGTAGGT ATTATCGGTG CATCCGGCAG CGGTAAATCT TCATTTATGT TCTGCGGAGT GATTCCGATC TTATACGGAG GTTTTCTTTC TCATGTGGGC CCTAACTGGC ACGTCATCAC CACACGTCCG GGCGGATGTC CCATTGACAA TCTTTCTGAA GCCTTGCTGC AGAAAGATGC AGAATACCTG GTAGCAGATG CAGAAGATAA GCGATTGAAA AAAACGATTA CTTCTACGTT ATTAAAAAGT TCGTCGCTGG GTTTAATTGA AGCGGTAAGA CAACTGCATC TGGATAATGG GAATAACGTT TTGATTGTTG CCGATCAGTT TGAAGAACTG TTCCGTTTTA AACGCACAGA AGATACCAAT ACCACCAATG AATCCTTGGC ATACATTAAT TTGCTGATGG AAGCGGTTAA AGATCTGAAA TCGAATATTT ACGTGGTAAT CACCATGCGT TCGGATTTTA TTGGCGACTG TGCACAGTTT CCTGAATTGA CAAAATACAT CAACGAGAGT CATTACCTGA TTCCTCAGAT GGTTCGAGAG CAGAAGCGTC TGGCCATTGA AGGACCGGTA GCGGTAGGTG GTGCGGAGAT CTCTTCCCGC CTTGTACAGC AGCTGTTAAA TGATCTGGGC GATAATCCCG ACCAGCTGCC GATCATTCAG CACGCGTTAA TGCGTACCTG GACATTCTGG GCAGAGAATC ACGAGCCCGA AGAAATTCTG GACCTGAGAC ATTACGAAGC AATCGGTTCC ATGTCGGGTG CTTTATCTCA ACACGCGGAT GAAGCATACG ATGAACTTAC CGAACAGCAG AAATTTTACT GCGAGATTTT ATTTAAAACA TTAACAGAGA AAGGTTCTGA TTCGGCTGGT ATACGCAGGC CCACAAAACT TTCCACTATT GCGAATGTTG CCGGCTGCAG TGAAGACGAT ATGGCGCATA TCATTGATCA CTTCCGTATA GAAGGCCGTT CGTTATTAAT GCCTCCGGTG CATGTTGCAT TGGCTTCAGA TAGCATCATT GATATTTCAC ATGAAAGTCT CATGCGTATC TGGACCCGGT TGAAAAAATG GGTAGAAGAA GAAGGTGAAT CTGCTCAGAT GTATATTCGT TTGTCTGATG CGGCATCCAG TTATCAGATC GGAAAGGCAG GTTTATGGAG GCCACCCGAT TTGCAGCTTG CATTAAACTG GAAACAGAAA AACAAACCTA CATTGGTTTG GGCGCAGCGC TACGACTCTG CCTTTGAGCG TGCAATGGTA TTTTTGGATA CAAGTAATGA GGCACATGAA GAAGAGCAGC GTTTAAAAGA ACTTGCACAA AAGCAAGCTT TAAAACGTGC CCGTATCATA GCGTTGGTTC TTGGTATCTT TACCATCATG GCGCTTGGTT CACTGGTATT CTCTATTCTG AAAACCGTTG AAGCGAACAA TCAGCGGATC GAGGCAGAAA AGCAAAAAGG GATCGCGTTG ACGCAGGAAG CTGAAGCTAA AAAGCAGACC GGTATTGCTG AGCAGCAAAC ACTGGAAGCG CAGAGGCAGA AAGATTTTGC GGTACTTGCT GCTGAAGAAG CAAAACATCA GCAAGGTATA GCGGAGAAAA ACTTTAACGA AGCACAGAAG CAGCGGAACA TCGCTGTAAC CTATGCTACA GAAGCAACCA AACAGCAGAA GATCGCCGAG ATGCAGCGCG AGCTTGCAGT GAAAAATGAG AAGCAGGCGA AACAGGAAAA GCAACGTGCC GATAAACTTC GTTTGCTGTC TATTGCCCAG TCTATGGCAG TGAAATCTGT GCAGATGGAA GAGGATACGA TGCGCAAAGC ATTACTGGCT TTTCAGGCGT ATGAATTCTA TAAAGAAAAT GGCGGAAATG TAAACCAGCA TGAAATCTAT GACGGTTTGT ATTATGCGCT GAAAAATCTG AAAGTAAAAG ATTACAATGC CCTGCATGCA CATAAAGATG CCGTGCGGTC AATTGCATAT ACAGCAGATG GCAAAGGCAT GTACACAGCC GGAAGTGATG GTAAGATCTT CAGCTGGGAT ATGACTGCTG CAAATCCGAA ACCAAAGACT GTATACAATA CCAATTATGC ACTCGGAGCA TTGTCTTTAA GTTCGAACGG TACCATGCTT GCCAGCGGCG GAGCATCACC GAATATTCGT ATGGTTAATC TTACGAAAGA AGGTGACGCA CCTTTACTAT TGAAAGGCCA TAAAAAAACG GTATTGTACA CAGCATTTTC ACCGGACAAT CAAACGCTGG TATCTGCCAG TGCAGACAGT ACGGTAATGA TCTGGAATCT TTCTTCCGGT GTACCGATAA CCGTTTACAG AGACAGACAT AATATCAAAG CAGTATCGCT GCACCCGAAA GGAAGAGTAA TTGCGGTCGC CAATGATAAA GGCGAAACCA TGATTATCTC CCTGTATAAT GAATTTACAC CCTATCTGAT TGATAGAGGA ACGACAGCAG ACTACAGCCT GCAATACAGC CACGATGGGG AGTTTCTTGC GATTGCCAAT AATTCAGGCC TGATCAAGAT CATGGATGTA GAAGGCAGAA GACTGGTTGT TGCATTACCC GGACATAAGG CACGCGTGAA CGAAATGAAA TTCAGTAAAG ATGATTCCAA ACTTGCTTCT GCAAGCTTTG ATGGTACCAT CCGCGTATGG GATCTTTCTG AATTGAGTGA GCAGCCGTTG ATCTTAAAAG ATCACACGAA CTGGGTATGG TCTATGACCT TTAATGCAGA AGGCGATAAG CTGATTGCCG GCTGCGGTGA TAACCTGATC CGGATCTGGC CGACAAGCAG CAAGATCATG GCCGATCAGA TGTGCGACCT GATCAAACGA AACATGACCG GTGCGGAATG GAAACGCTAT GTAGCAAAAG ATGGTGTACC GTATGAATTA ACCTGTCCGG CTTTACCAAG CCAGGAAACC AATATGAATA CAATTTACTA A
|
Protein sequence | MQNPFPGLRA FNVDESHLFF GREGQSDEVL DKLSKNKFVG IIGASGSGKS SFMFCGVIPI LYGGFLSHVG PNWHVITTRP GGCPIDNLSE ALLQKDAEYL VADAEDKRLK KTITSTLLKS SSLGLIEAVR QLHLDNGNNV LIVADQFEEL FRFKRTEDTN TTNESLAYIN LLMEAVKDLK SNIYVVITMR SDFIGDCAQF PELTKYINES HYLIPQMVRE QKRLAIEGPV AVGGAEISSR LVQQLLNDLG DNPDQLPIIQ HALMRTWTFW AENHEPEEIL DLRHYEAIGS MSGALSQHAD EAYDELTEQQ KFYCEILFKT LTEKGSDSAG IRRPTKLSTI ANVAGCSEDD MAHIIDHFRI EGRSLLMPPV HVALASDSII DISHESLMRI WTRLKKWVEE EGESAQMYIR LSDAASSYQI GKAGLWRPPD LQLALNWKQK NKPTLVWAQR YDSAFERAMV FLDTSNEAHE EEQRLKELAQ KQALKRARII ALVLGIFTIM ALGSLVFSIL KTVEANNQRI EAEKQKGIAL TQEAEAKKQT GIAEQQTLEA QRQKDFAVLA AEEAKHQQGI AEKNFNEAQK QRNIAVTYAT EATKQQKIAE MQRELAVKNE KQAKQEKQRA DKLRLLSIAQ SMAVKSVQME EDTMRKALLA FQAYEFYKEN GGNVNQHEIY DGLYYALKNL KVKDYNALHA HKDAVRSIAY TADGKGMYTA GSDGKIFSWD MTAANPKPKT VYNTNYALGA LSLSSNGTML ASGGASPNIR MVNLTKEGDA PLLLKGHKKT VLYTAFSPDN QTLVSASADS TVMIWNLSSG VPITVYRDRH NIKAVSLHPK GRVIAVANDK GETMIISLYN EFTPYLIDRG TTADYSLQYS HDGEFLAIAN NSGLIKIMDV EGRRLVVALP GHKARVNEMK FSKDDSKLAS ASFDGTIRVW DLSELSEQPL ILKDHTNWVW SMTFNAEGDK LIAGCGDNLI RIWPTSSKIM ADQMCDLIKR NMTGAEWKRY VAKDGVPYEL TCPALPSQET NMNTIY
|
| |