Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1568 |
Symbol | |
ID | 4185602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 1838473 |
End bp | 1843530 |
Gene Length | 5058 bp |
Protein Length | 1685 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638071562 |
Product | hypothetical protein |
Protein accession | YP_678179 |
Protein GI | 110637972 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.518804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.238693 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTAC AAAGCGCTAT AAAGACAACT ATACTAAGCG TTTTTTTTGT TCTCGCAAGC CTGTTAAGCC TTGCTCAGAA TGGTACGGAA TGGATCATAC CTAATCAGAT TTATTATAAG GTTAAAACCT ATCAAAACGG TATTTATGCA ATACCGTATG CAAAACTCCA ATCAGCTGGT ATCAATACCG CGGATCTTAC AAATCTTCAG ATGTGGTATC GTGGTAAAGA GCAATCCATT TTATTGCAGC ATGACAGTTT ATTTTTCCTA GGTAAAAGGA ATGATGGAAA GCTGGATTCT CTGTTATATG CGAACAGTGC GAACCAGCCG CATGCCTATT ACAGCATCTG GTCTGACACA TCAGCATATT TTTTTACCAT CGGTACACAA CCAGGGAAAC GCATTGCAGT GAGTGCTGTT CCTTCCGTGA CAACTACTGA ATCCTGGTAT TACGAAGAAG CATTATCTAT CTATACAGAT CTCTATTATC CGGGAGCCTA TTATTCCATA GAAACATTGA AAAGTGATTA CGATGTTGGA GAAGGATGGT TTGGCGATAA AATAGCCAAA GGCGGAAATC CTGTAACGGG TAGTTATACG ATACCAATTT CTAGTGCCTA TACACAGACA GCTGCCGATG TAAAAATAGA AGTACAGATT GTTGGGGAGT GGAACAATTC AAACCATGTA GCATCCCTAC TGATCGGTAA TGCAAATACT CCGGATTATA CATTTAATTT TTCAGATTTT AGTGCGTACG AATTCCGTAA AATTTCAGCC GTAATTCCCA ATTCTTATTT AACAGGTAAA AATTCCATAG CCTGTACTAT ACGGCTTTCC AGTGCGGGTA CAGATGTTGT AGCGGTATCG TATATTAAAA TTACCTATCC CAGAAACTAT CTGTTAACCA ATTCAAAAGA TCTTTCTGTT ATTCTTCCTG AAAATGGTAA TCAAAACAGA ACCGTTAATT TGTCTAATGT TACAGGCAAT CTATTCATAT TAGATATAAC AGACGAGCTG AATCAGGAAC GCATACCTTA TACGCAGGCC GGAGCTACTG CATCCGTTGT TATTCCAAAT ACAACTAAAA CTTTTTTTGT TTTTACTAAT CGGTATTATT CAATTTCTCA GGTTGAACTT GTAAACATGT CTCTGCCGAA CCTGAGCAAA GACTATTTTA TTATATATCC TGAAGTATTT AATTCATCGG CCTTGAGTTA TGATCAATAC CGAAGTTCAA ATGCCGGTGG TAATTATGAT GTAGAACGCT GTTCGTTTGA AAAGCTATGT AATCTGTATA GCTATGGAGA GTATTCTTCT ATAGCTATAA AAAGATATTG TTCTGAGATT ATTCAAGTCA ATTCAAAAGA AAAATATTTA TTGATACTTG GAAAAGGTGT GGTGCCTTCT ATGAGTAATT TTATTAATAA TGTCGGCAGC GTTGTTTACC GGAAGAACCC TTCTTTTTAC TGGACAAATA CGGATTTTAA AAATCATTTT GTAAATCTTG TTCCGCCATT TGGTGAGCCC GGTTCAGATT TAATGTTTAG CGTAGACAAT AAGTATGCAG CACAGATCCA TACCGGCAGA GTACCTGCGC GCACCAATAG TGAGGTATTG GGATATCTGG AAAAAGTTCA GGCTCACGAA TCCCTGGATT CAACATTGTT GTGGCGGAAG AATTTAATTC ATTTAAGCGG CGGGGAAGAT GCTGCCCAGG TTACGCTATT CAAAAAATAT GTTGATACCT ATAAAGCCTA TGTTGAAGGA CCGCATCTTG GAGGAAAGGT CGTAAAAACA TATGTGAAGA ATTTACAGAA CGGTGCCGTA GATGATCAGC TGATCTCAGG TGTTGCGGAT GATCTTAATA AAGGGATTTC GTTAATGACA TTTTTTGGGC ATTCATCGGC ATTGATTAAT GATGTGGATA TAGGATTTGT TTCCAATCCG GTATATGGCT ACAAAAATTT TGGCAAGTAC CCAATGATGC TCGTGAATGG CTGTACATCT GCAAATATTT TTTCCAACTA TTCTTTTGCA GAAGACTGGA TCAATACCCC GGGCGTTGGG GCTATAAATG TATTGGGCCA TACCGATATT GGCTATACAA ATAATCTGTA CCAATTTTCG CTATACTTTT ATAATTTTCA ATTTAATGAT TACCGGTATA TAAATAAGCC TGTTGGTTTT ATTCATAAAA AAGTAATTGA TTCGATCAAT TCGATTAATG CATCAATAGA TGTTACTTCT CAGGCACAGA TTACCCAGAT GAATCTGACA GGTGACCCTG CTTTACGTTT GTATAATCCC GGCCGGCCGG ATTATGCGAT TTATGGCGAT AATCAAACAG CAGAAGCAAA TTGTATTATT ACGCCCTCCA CAACAGCCAG TATAACAGCG AAAGATCCGT TTAGAATAAT TATACCGATT GATAATTATG GAAGTACAAC AACAAAAACG GTTGATATGA TCATTAAGAG ATATGTAAAT AATGTTTTTG TGAAAAACTA TCAGGCTACC TTACCTCCGC TGTATTACAG AGATACGGTA GTCTTTGATA TTTCAAATAA TGACGGGAAT TATGCAGGTG ATAACCGGTT TGAAATAACA ATCGACCCTT CCGATTCGCT GAAAGAAATG CGCAAAGATA ATAATGTTGC TTATATCAAT TATTACATGC GTTCAAGTGC GATTAAATGT ATGTATCCGT TGAACTACAG CGTTGTTTCA AATCAGCCGA CGGTATTGAC CGCTCAGGCA ACAAATCTGT TTATTAATTT AACAGATTAT TATTTTGAAA TAGATACATC TAAGTTTTTT AACAGCATGC ATAAAAAAAC AGCTGTTATT TCTTCCGGAT CGTTGCCTAC CTGGACACCC GGGCTAATCA GTGATTTGAC TCCTTCGGAC AGTATTGTTT ATTTCTGGCG GGTACGTTTT AATACCATTG CACCAACAGA AGATACGATA TGGGATAACA GTTCATTTAT ATATATAAAG AACAGTAATC CGGGCTGGTC GCAAACACAT ATTGATCAGT ATCTGGAAAA TAATCTGGTA GGTTTGTCAT ATAACCGTAA TCAATTTAAA TGGGAGTTTC CATTAACTTC TATTGCATTA ATGGTACAGG CAGCAGGCGG AAGGTATCTG GGAGAAAAAG ATCTGACGCT TTTAACACTG AATAATTTAC CCTTGCTTCA AAATACACCT TATTATAATT GTGTAGGCGG CAGCGGCGGT CTGTTTATGC TTACACTGGA CCGTTCCTCT CTCGAGCCGG TTGTATACAA TCCCAATTCA GAAGGATGGT ATTATTGCGG GCAGAATTTT GATACCCGGC TTGTATTGGA AATACCTTTC CCAACAAATA CAAATACCCC GCCATCAACA ACCTGGCTCT ATGGACGGGA TGCAGACGGT TTGATACGTG CCATACAGCA TACAAATAAA AATGATTACC TGATTATTTT TAATGACGGG AACAGTATGA AAAATGGGTG GCCGGCAAAC CTGCAAACGT ATTTTAAGGA CTCGTTACAT GCCACGCAGA TCAGTGCGTT AACAAGCGGC CAGCAACCGT TTTTATTAAT AACAAAGCGT ATCAATACAA GCCCGATAAC TGAAAAGGTA AATGTAAGCA CTGCAACAGA TTCGTTTGTT GCTATTGATA CCACACTGAA CAGTTTCTTT TATAAAGGAA GTATTACGAC AAATTTAATC GGACCCTCAA GCATGTGGGG TAAAATGTAT TTTGCCATTG ATACTTCACT AAATGATGAG ACAAGCCTGA AGCTGATCCG TTTTGACATC ATGGCCAATC CGATTGATAC AATACTATTA CCTAAAGTTG ATTCATTAGA CTTAAACGGA ACGTATTTAA TTGATGGGGT GCATGTGTAT TGCAAGCTGC TGCTGGATTT GCAGGATGAC GGAACATTGA CACCGCCGGC ATTGAAAAAA TGGCAGATTA TCTATAACGG TGTGCCGGAA GGCACATTGA ACCCGTACGC GGTAGGACTG GATACCTATA CCATTCCAAA TCATCCCGAA GGAGATAGTA TTTCCATCAA ATATCAATTC GATAATATTT CTGATTATGA TTTCTCAAAA CCGATACAGG TTGTTTATTC AATTCGTAAT GAATCGGGCT CTCTACGCAT CGATACGATT ACATATTCGG TATTGAATGC CAGACAATCG CTTGTGTTTA CGTACAAGTT TACGACAAAA GGCCTGACAG GAAAAAATTA TATTCAGGCT TATGTGAATC CTCAGATGCA ATCCGAGCAA TACTATTCCA ATAACGTGCT GGAATCGTCG TTTGTGATTG AAGCGGATAA AACACAACCG ATATTAGAAG TCGCCTTTGA TGGTATACGG ATTTTCGATG GTGATTTAGT TTCTGCCAGC CCGCTTATAC ATATTTCATT AAAAGATAAT AACCAGTATC TGTTGCTGAC AGATCCGGCC AGTATTGAAC TGTATCTGTT ATATCCGGGG CAGACAAATG CGGTGCAGAT TACATCAACC AACCCGATGG TGCAAAGCTG GAGTCTGGAA AATGCGCGTA CCAATACTTT TGTTGCAGAG ATTAAACCTG CCAATCTGCC GGATGGAACT TATACTATTA TTGTTCAGGG GAAAGACGCT TCAGGTAATA AAACAGGCGG TCACCAGTAT AAAATTACAT TCAAAGTAGA AAACAAACCA TCGATCTCTT ATTTCTACCC GTATCCCAAT CCGTTTTCTA CAAGCACCCG ATTTGTATTT ACATTAAGCG GCACAACCGT TCCTGATAAT TTAAAAATCC AGATCATGAC AGTGTCCGGA AAAATTGTAA AGGAAATATT TAAAGAACAA TTAGGTCCGT TACATATTGG CAATAATATT TCAGAGTATG CCTGGGATGG CACAGATGAT TTCGGTGATC GCTTAGCGAA CGGCGTATAT TTATACCGTG TGATTATCAA AGATAGCAAT CAATTTTTCG AACACAGAGA AACTGCGGGA GACAAAGCCT TTAAGCAGGA TTGGGGCAAG CTGTATATAT TGAGGTAA
|
Protein sequence | MSVQSAIKTT ILSVFFVLAS LLSLAQNGTE WIIPNQIYYK VKTYQNGIYA IPYAKLQSAG INTADLTNLQ MWYRGKEQSI LLQHDSLFFL GKRNDGKLDS LLYANSANQP HAYYSIWSDT SAYFFTIGTQ PGKRIAVSAV PSVTTTESWY YEEALSIYTD LYYPGAYYSI ETLKSDYDVG EGWFGDKIAK GGNPVTGSYT IPISSAYTQT AADVKIEVQI VGEWNNSNHV ASLLIGNANT PDYTFNFSDF SAYEFRKISA VIPNSYLTGK NSIACTIRLS SAGTDVVAVS YIKITYPRNY LLTNSKDLSV ILPENGNQNR TVNLSNVTGN LFILDITDEL NQERIPYTQA GATASVVIPN TTKTFFVFTN RYYSISQVEL VNMSLPNLSK DYFIIYPEVF NSSALSYDQY RSSNAGGNYD VERCSFEKLC NLYSYGEYSS IAIKRYCSEI IQVNSKEKYL LILGKGVVPS MSNFINNVGS VVYRKNPSFY WTNTDFKNHF VNLVPPFGEP GSDLMFSVDN KYAAQIHTGR VPARTNSEVL GYLEKVQAHE SLDSTLLWRK NLIHLSGGED AAQVTLFKKY VDTYKAYVEG PHLGGKVVKT YVKNLQNGAV DDQLISGVAD DLNKGISLMT FFGHSSALIN DVDIGFVSNP VYGYKNFGKY PMMLVNGCTS ANIFSNYSFA EDWINTPGVG AINVLGHTDI GYTNNLYQFS LYFYNFQFND YRYINKPVGF IHKKVIDSIN SINASIDVTS QAQITQMNLT GDPALRLYNP GRPDYAIYGD NQTAEANCII TPSTTASITA KDPFRIIIPI DNYGSTTTKT VDMIIKRYVN NVFVKNYQAT LPPLYYRDTV VFDISNNDGN YAGDNRFEIT IDPSDSLKEM RKDNNVAYIN YYMRSSAIKC MYPLNYSVVS NQPTVLTAQA TNLFINLTDY YFEIDTSKFF NSMHKKTAVI SSGSLPTWTP GLISDLTPSD SIVYFWRVRF NTIAPTEDTI WDNSSFIYIK NSNPGWSQTH IDQYLENNLV GLSYNRNQFK WEFPLTSIAL MVQAAGGRYL GEKDLTLLTL NNLPLLQNTP YYNCVGGSGG LFMLTLDRSS LEPVVYNPNS EGWYYCGQNF DTRLVLEIPF PTNTNTPPST TWLYGRDADG LIRAIQHTNK NDYLIIFNDG NSMKNGWPAN LQTYFKDSLH ATQISALTSG QQPFLLITKR INTSPITEKV NVSTATDSFV AIDTTLNSFF YKGSITTNLI GPSSMWGKMY FAIDTSLNDE TSLKLIRFDI MANPIDTILL PKVDSLDLNG TYLIDGVHVY CKLLLDLQDD GTLTPPALKK WQIIYNGVPE GTLNPYAVGL DTYTIPNHPE GDSISIKYQF DNISDYDFSK PIQVVYSIRN ESGSLRIDTI TYSVLNARQS LVFTYKFTTK GLTGKNYIQA YVNPQMQSEQ YYSNNVLESS FVIEADKTQP ILEVAFDGIR IFDGDLVSAS PLIHISLKDN NQYLLLTDPA SIELYLLYPG QTNAVQITST NPMVQSWSLE NARTNTFVAE IKPANLPDGT YTIIVQGKDA SGNKTGGHQY KITFKVENKP SISYFYPYPN PFSTSTRFVF TLSGTTVPDN LKIQIMTVSG KIVKEIFKEQ LGPLHIGNNI SEYAWDGTDD FGDRLANGVY LYRVIIKDSN QFFEHRETAG DKAFKQDWGK LYILR
|
| |