Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_2494 |
Symbol | |
ID | 4184638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 2865480 |
End bp | 2871566 |
Gene Length | 6087 bp |
Protein Length | 2028 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638072487 |
Product | hypothetical protein |
Protein accession | YP_679090 |
Protein GI | 110638881 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.415505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0870221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTTT TAACTTTAAC CATTTTATCA TTCTTTACAC TTACTCTTGC AATGGCACAA ACAGATGGTA TGAAACAACG TTGGGACAAA ATTGACGCTT ATTTAAATGA ATCCTTACCT AAGTCTGCGT TAACAGACCT CACGAAACTT TACAGCGAAG CAAAAGAAAA AAAAGATGCA GATACGCAGA TCAAAGCATT GATGTATATC ATGCGCTGCA CGGATATGTC TGAAGAAGAT GCTTTTGAAA AAGACATTGC TTTTATACGC AAAGAAATTA AGACTTCTGC CTTCCCGGTT AACGCTGTTC TTGAATCCAT GCTGGGAGAA ATGTACTGGC AATATTTTCA GCGCAGCAGA TACCGTTCTG ATATCAGTGC TTCTACCAAT GCAGATGATA CGGATATTCA GACATGGGAT CTGAAAACAA TTCTGGAAGC GGCTATCAAC TCCTATCATG CATCGCTCAA AGATAAAGAG CTGTTGCTGA ACTATCCGAC AGAAACATTA AAAGAAGTTA TCTATAAAAG CAGTAATCAT GTTTATACAG CCAATCTGTA CGACTTTCTG GGTAAAAGAG CACTGACATT TTTCCAGTCG TCAGAATCAA GTGTAAGCCG CCCTGCGGAA CAATTCAATC TGAATGATGC ACGGTATTTT TCACTGCCTG CTGTATTCAC CACATTAACC ATACAAAGCG AAGACACGCT GAGCCTGCAT TTATATGCCA TGCGATTGAT GCAGGACCTG GAAAAGATCC ATCTGAATGA TAGCAACCCA ACTGTACTTG TTGATCTGGC TATAAGCCGT TTAAAGTTTA CGTCAGCACA TTCAGGCCTT CCAAATGCTA TTGAGCTTAA ACTTGCTGCA CTTGAAAATC TGGAAGCATC TTCCTTACCC TATCCGATCT CAACGGAGGT AAGTTATGAG ATCGCGGTGA TCTGGCATGA ACGTGCTGCG GATATGAGCG GCTTGAAAAA CAGTCCGAAG TATCCGGATG CAAATATTAA AAGCATACAG ACGTGTGATG CAGCCATTAA ACGTTTTCCG GGCTCTGAGG GTGCCAACCA CTGTGAATCT GTAAAAGAAA CAATCCTTAA ACCATCGCTT GAATTAAAAC TGGAAGAAGT TAATATTCCG CAGGCACCTT TCCGCACGCT GGTTGCTTAT AACAATATCA AAACCCTTTC ACTTCGTGTG ATCCGGTTAA CCGCGGAAGA ATCGGAAAAA TTAAGAAATG ATCTCGACTA CCGGTACGAT GAACGTTTCG AATCATTCAA GCCTTATCTT GACCGCACAC CTGTAAAAAA ATGGTCGGTT GGTCTGCCGC AGGATCCGCA GCTGCGCGGG CATCACACAG AAGTTGTTAT TGACCCGCTT GCAGAAGGTT TTTATCTGAT CATTGCTTCA GATGTTGAAA AGATTGAAAA ACAGAAAGCG CTTTTTGCAT ATACGTTTAC AACCGTTTCA AACATTTCAT ATGTAGCACG TACTACGGAA GGCAAGGTGC AGCTGTATGT ATTGAACAGG CATACAGGCA AACCCATTAA AGATGCACAG GTAAGCGCTT ATACAAACGA ATATAATTAT GAAACACACC GGAACCGGAA AAAAGTAATA GGTATCTTTC CTACCGATGA AACTGGTTAC ACAGAAGTAA AGGCTGCAAA AGATAGCTAT TCGTCTTATG TAAATTTTGA TATTAAAACA AAAACAGATC GCTTGATCAG TGATAACAGA AATTCCTCGG GGTATTATAT CTACAATGAA AAACATCCGA CCGGTAATTC TTTTTATCAT ACGTCTATGC TGGTGTTTAC AGATCGTGCC ATCTACAGAC CCGGACAAAC AATCTATTTC AAAGGCATTG TATTTAATAC GGATGTAAAA AATACATATA ACGTTAGTAA AAATACTCCC GTACGCATTT ATTTCTATGA TACGAATAAT AAAGAAACGG CATTTGCAGA TCTTGTCACG AATGAATTTG GATCTGTGCA GGGAACTTTT ACCGCACCAG TCGGATCGCT TACCGGAAGC ATGCATATTG GCAATGACAT GGGGAATGCC TATTTCAATG TAGAAGAATA TAAGCGCCCG AAGTTTGAAG TAAAGATACA GCCGCTTACG GGGCAATATA AACTGAATCA GGAAGTAGAA GTTAACGCTT TGGCAAAAGC CTATGCCGGG AATGCTATAG ATGGTGCAGA AGTTAATTAT CGTGTAGTAC GTCAGGTACA GATCCCTTAT TGGTACGCGC GTTACTGGGG TTACCGCAAC CCGCAGGAAA CGGTTATTAT GACCGGAAAA TCTGTTACAG ATGCAGCTGG TGCATTCTCC ATTAAATTCG CTGCCCTGCC TGATGCCTCC GTTTCTCCGG AAAGCAAATC AACATTCCTG TATACCGTAT ATGCGGATGT GATTGACATC AATGGTGAAA CACACAGCGA CCAGCTCTCT GTTTCAATCG GTTATTCTTC TCTGGTATTA AACATTCAGG CTCTTGCCGT TGTTGAAAAA GGAAAGCCAT CCGCATGCAC TTTTTCTGCT GCCAATCAAT CCGGAGAACC TGAAGCCGCA ACCATACAGG TAAAAGTTTT CAAGCTGGCA GCGCCTGCTA AAGCATTACG TGAACGTTTA TGGGAAGCAC CGGACAAGCC CCTGTTAAGC GAGGAGGCGT TCCGGAAATT ATTCCCGGAA GATGTATATG CAAATGAAAC GGATCAGGAA AATTTTCCTC AGAAATTAAT CTTCGAAACA ACCCTGCAGG CAACCAAAGA AAAAGGCGCG GTATGGAATG TACCTGAAAA CTGGGAAACC GGACAGTATC TTGTAAAAGC AACAGCACTG GATAAAGATA AAACCGAAAG TGAAACACAA ACGACATTCA CTAAAACAGA TCCGTCAGGC AGCAAACTTC CATATGCCAT GCAAAAATGG ACAAACGTTT TACCTGCCAG TACACAACCG CTGCAGAAAG CCATGTTTGA GATCGGCTCA TCTTTCTCTG ACGTACATGT GATCTGCGAA GTGGAACGTG ACGGAGTAAT TATCCGCAAA GAATTTATTA CACTGAAAAA TGATATTAAA AAATTTGAAC AGCTTATTAC TGAAGCAGAC CGTGGCGGCT TAGCTGTGCA TTATGTATTC GTGCATAATA ACCGTTTGTA CGGCGGCACA GAAAACATAT CCATACCGTT TAGCAATAAA GACGTAACGA TAAAATGGGA AACCTTCCGA AGCAAGCTGC AGCCGGGGCA GGCAGAGCAG TGGCGTGTAA TCATCAATAA AAAGGATGGA GAAAAACAGG CTGCAGAATT TTTAGCTACG ATGTATGATG CTTCGCTGGA TGCCTTTGCA TCCCATTCCT GGTTTGTAAA CCTATACCGT GCCAATTACG GCAAACTCAG CTGGATGAAT AATCCCGCTA CTTTCGGTTC CGAAGCGTTT ACCGTATGGG ATTATTATGG ACATGGAGGT TATGGTTCGT ATAAGTATTA TGACCAATTC AACTGGTTTG GTTATTCCTT TGGCTATCAC TATGGCGGCC GTACACGCAA TGGAGAGCTG CGTAAAAAAT CAGCACCTGG CGGCGGCGGG GAACTTCACG AAGAGGCGGA AATGGATATG GCAGCCCCAA TGATGTCGGC TGTTGCAAAA GAAGAATCCA GAAATGAACG TGTAAATTTT CTTCCGCCTG TTGTAAAAGC AGATTCAGAA GTGCTTGCTG TGAAAGAACC TGAAGCGGTT GAACAGCCTG CGATCCGGAA AGATTTCCGT GAAACGGCTT TCTTTTTCCC GGATCTGAAA ACAGATGCTG AAGGCAATGT AGTGTTAAAC TTCACCATGC CGGAAGCACT CACACGCTGG AAAATGTTGG GGTTTGCGCA TACCAAAGAT ATGAGCTACG GCTTCACGCA ACAGGAAGTG GTGACGCAGA AAGAGCTGAT GGTAGTGCCG AATGCACCAC GTTTTTTACG TGAAGGAGAT CAGCTTGTTT TATCTGCAAA AGTAACAAAC CTGAGCGGTG CACCAATGAA AGGTACAGTA ACACTACAGC TGTTTGATGC GGCAACCATG CAGCCGATTG ATAAAGAATT AGGAAACACA GCACCATTGA AAGTATTTGG CGATGCCAAC ACACCAAGCG AAGTTAAAAC ATGGTCGATC CAGGTACCGG GGGGCTTTCA GGCGATTACT TACCGCGTAG TTGCCGCGGC AGGAAACTAT AGCGATGGCG AAGAAAACGT AATACCGGTA TTTTCAAATC GCATGCTGGT AACAGAAAGC ATTCCAATGG TGATTACCAA AGGCCAGACC AAAACATATA ATCTGGATAA ACTTGCCACC ACTACATCTA AAACACTTGT TAATCATCGT CTGACCTTAG AGCTGACGCC AAATCCGGTA TGGTACGCCG TGCAGGCCTT GCCGTATCTG GCGGAGTATC CGTATGAATG TGCGGAACAA ACGTTCAGCC ACTATTATGC AAATTCGCTT GCAGGCTTTG TTGCCAACAG CAGTCCGGCC TTGAAACGCA TGTTTGACTT ATGGAAAAAA ATGGAACCGG ATGCTTTGCT TTCAAATCTT GAAAAAAATC AGGAATTGAA AATGCTCTTG CTGGAACAGA CACCTTGGTT ACGCGACGCA GAAAATGAGA CGGAACAGAA AAGACGTATC GGTTTATTGT TTGACCTGAC GCGCATGAAC AGCGAACTTG ACCGTGCTAT CGGCAAGCTT GAAAAAATGC AGCTGGGTAA CGGAGCGTGG CCATGGTTCA CCGGGATGCG CGAAGACCGT TATATCACAC AGCATATTCT TATTGGTTTA GGCCATCTGA AACATTTAGG CGCACAGGGC CGACATGATG AACAGATCGC AGACATGACC CGGAAAGCAC TGGATTACAG CGATGCAAAG CTGCTGATGG ATTATAAAGA ATTAGTAAGA CGTGTACAGG TTGATAAACT TAAAATAGAA GAAGTGCGTC CGTCGTCTCT TGAAGTACAT TATCTGTACG GAAGAAGTTT CTTTTCTGTA AAAGCAGAAG GAGAACTTGC AACAGCCATT CAGTTTTACA AAGACCAGTC GCGCAAATAC TGGACAGAAT ACGCCTTATA CGAGCAGGCT TTGATCGGAC TTACTGCCTA CCGCGATGGC ACTACAACGT TCTCTGATGC GCTGCATAAA TCCTTTACAG AACGTGCAAT CAGTACAGAA GATAAAGGCA TGTACTGGAA AACCAATCCG GGCTATTATT GGTATGAAGC GCCGATCGAA CGCCAGGCAA TACTGATCGA ATTCTTTGAA GAAGCGGCTA AAGACCGTAT TTCTGTAGAT AAAATGCGTT TCTGGCTGCT CACACAAAAA CAAACGACAC ACTGGAAAAC AACCAAAGCT ACTACAGAAG CTTGTTATGC CTTATTGCTG AACGGAACTT CCTGGTTAAG CGCAGATGAA AGCCTTACGG TACAGGTTGC CAATACCAAC GTAGTTTTCC CGAAAAGTGA ACTGAATCCT ACGCTGACAA AAGTGTGGAA CACAACAGAG ATTAAACCAC AGATGGCTAA GGTAACATTA AGCAAACAAG GTGAAGGTGT GGCATGGGGC GCCTTACACT GGCAATATTA CGAAGACCTC GATAAGATCA CCACACATAA AAATGATCAG ATTCAATTAA CAAAAGAACT GATGCTTGAA GTACAGACTG CCGGCGGTAA GGTATTGACC CCTATAACGG CTGCAACAAC TCTGAAAGCC GGTGATCTGG TAAAAGTGAA ACTGGTGATC CGCTCAGACC GTGATCTGGA GTATGTACAT GTACAGGACA TGCGTGCAGC AGGTTTCGAG CCCTTAAATG TATTATCCGG AGCAAAATGG AACGGAAACT TTGGCTATTA TGAAAGCACC CGTGATGCTT CTACCGATTT CTTTATCGGG TTCCTGCCAA GAGGTACTTA TGTGTTTGAA TACAGCCTGC GTGTGAATAA TGCAGGTAAC TTCTCGAACG GCATTACCAA TATCCAGTGC ATGTATGCCC CTGAATTTAA TGCGCATTCA CAGGGTATCC GAGTAGAAAT CAAATAG
|
Protein sequence | MRFLTLTILS FFTLTLAMAQ TDGMKQRWDK IDAYLNESLP KSALTDLTKL YSEAKEKKDA DTQIKALMYI MRCTDMSEED AFEKDIAFIR KEIKTSAFPV NAVLESMLGE MYWQYFQRSR YRSDISASTN ADDTDIQTWD LKTILEAAIN SYHASLKDKE LLLNYPTETL KEVIYKSSNH VYTANLYDFL GKRALTFFQS SESSVSRPAE QFNLNDARYF SLPAVFTTLT IQSEDTLSLH LYAMRLMQDL EKIHLNDSNP TVLVDLAISR LKFTSAHSGL PNAIELKLAA LENLEASSLP YPISTEVSYE IAVIWHERAA DMSGLKNSPK YPDANIKSIQ TCDAAIKRFP GSEGANHCES VKETILKPSL ELKLEEVNIP QAPFRTLVAY NNIKTLSLRV IRLTAEESEK LRNDLDYRYD ERFESFKPYL DRTPVKKWSV GLPQDPQLRG HHTEVVIDPL AEGFYLIIAS DVEKIEKQKA LFAYTFTTVS NISYVARTTE GKVQLYVLNR HTGKPIKDAQ VSAYTNEYNY ETHRNRKKVI GIFPTDETGY TEVKAAKDSY SSYVNFDIKT KTDRLISDNR NSSGYYIYNE KHPTGNSFYH TSMLVFTDRA IYRPGQTIYF KGIVFNTDVK NTYNVSKNTP VRIYFYDTNN KETAFADLVT NEFGSVQGTF TAPVGSLTGS MHIGNDMGNA YFNVEEYKRP KFEVKIQPLT GQYKLNQEVE VNALAKAYAG NAIDGAEVNY RVVRQVQIPY WYARYWGYRN PQETVIMTGK SVTDAAGAFS IKFAALPDAS VSPESKSTFL YTVYADVIDI NGETHSDQLS VSIGYSSLVL NIQALAVVEK GKPSACTFSA ANQSGEPEAA TIQVKVFKLA APAKALRERL WEAPDKPLLS EEAFRKLFPE DVYANETDQE NFPQKLIFET TLQATKEKGA VWNVPENWET GQYLVKATAL DKDKTESETQ TTFTKTDPSG SKLPYAMQKW TNVLPASTQP LQKAMFEIGS SFSDVHVICE VERDGVIIRK EFITLKNDIK KFEQLITEAD RGGLAVHYVF VHNNRLYGGT ENISIPFSNK DVTIKWETFR SKLQPGQAEQ WRVIINKKDG EKQAAEFLAT MYDASLDAFA SHSWFVNLYR ANYGKLSWMN NPATFGSEAF TVWDYYGHGG YGSYKYYDQF NWFGYSFGYH YGGRTRNGEL RKKSAPGGGG ELHEEAEMDM AAPMMSAVAK EESRNERVNF LPPVVKADSE VLAVKEPEAV EQPAIRKDFR ETAFFFPDLK TDAEGNVVLN FTMPEALTRW KMLGFAHTKD MSYGFTQQEV VTQKELMVVP NAPRFLREGD QLVLSAKVTN LSGAPMKGTV TLQLFDAATM QPIDKELGNT APLKVFGDAN TPSEVKTWSI QVPGGFQAIT YRVVAAAGNY SDGEENVIPV FSNRMLVTES IPMVITKGQT KTYNLDKLAT TTSKTLVNHR LTLELTPNPV WYAVQALPYL AEYPYECAEQ TFSHYYANSL AGFVANSSPA LKRMFDLWKK MEPDALLSNL EKNQELKMLL LEQTPWLRDA ENETEQKRRI GLLFDLTRMN SELDRAIGKL EKMQLGNGAW PWFTGMREDR YITQHILIGL GHLKHLGAQG RHDEQIADMT RKALDYSDAK LLMDYKELVR RVQVDKLKIE EVRPSSLEVH YLYGRSFFSV KAEGELATAI QFYKDQSRKY WTEYALYEQA LIGLTAYRDG TTTFSDALHK SFTERAISTE DKGMYWKTNP GYYWYEAPIE RQAILIEFFE EAAKDRISVD KMRFWLLTQK QTTHWKTTKA TTEACYALLL NGTSWLSADE SLTVQVANTN VVFPKSELNP TLTKVWNTTE IKPQMAKVTL SKQGEGVAWG ALHWQYYEDL DKITTHKNDQ IQLTKELMLE VQTAGGKVLT PITAATTLKA GDLVKVKLVI RSDRDLEYVH VQDMRAAGFE PLNVLSGAKW NGNFGYYEST RDASTDFFIG FLPRGTYVFE YSLRVNNAGN FSNGITNIQC MYAPEFNAHS QGIRVEIK
|
| |