Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1392 |
Symbol | |
ID | 4185288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 1613232 |
End bp | 1616687 |
Gene Length | 3456 bp |
Protein Length | 1151 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638071385 |
Product | transglutaminase-like enzyme, cysteine protease |
Protein accession | YP_678003 |
Protein GI | 110637796 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.284441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.67203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATCA AAGTCGCAAT TAATCACAAA ACAACTTACT CGTACGACAG GCTTGTCAAC CTGTCGCCCC ACGTATTCAG ATTAAGACCC GCAAGCCACT CAAGAACTGC TATTGAAGCA TATTCATTCA AAGTCAGTCC GGCGAACCAC TTTATTAACT GGCAGCAGGA TCCTTTCGGC AATTACCAGG CGCGCGTTGT CTTTCCTGAA CAGACGAAAG AATTAAGCAT CGAAGTGGAA GTAATCGCAA ACATGGTTGT GATCAATCCA TTTGATTTTT TTGTGGAAGA ATATGCAAGC AAATTTCCGT TCCAATACCA GGGGCAGTTG GTTAAAGAGC TGGCTCCGTA CCTGGAAGTA AAGGATGACG GCCCCATGCT TGAGGAATGG ATGAAAACAG TCAGCAGAGA ATCTATTGAG ATCGTTGATT TTCTTGTTTA CATCAATCAG AAAGTATATA AAGATATTAA CTATTCCATT CGTATGGAAG CGGGTGTACA GACACCGGAT GAGACATTAG GATTAGCGCT GGGTTCCTGC CGGGATTCGG CCTGGCTGCT GGTGCAGGCG CTTCGTAAAC TTGGCCTTGC TGCACGTTTT GTTTCGGGGT ATCTTGTACA GTTAAAAGCA GATGTGGAAG CATTGGACGG ACCTTCCGGA CCGCCGGCAG ATTTTACCGA TCTGCATGCC TGGGCGGAAG TATACATTCC CGGCGCAGGT TGGGTTGGCC TGGATGCCAC TTCGGGCCTG TTTGCCGGCG AAGGACACAT TCCATTAGCC TGTACGCCCG ACTATCTAAG TGCGGCGCCG GTGGTAGGTG CAACGAGCAA ATGTGAAACA ACGTTCTCGT TCTCCAATGT TGTTACCCGT ATTCATGAAG ACCCGCGTGT AACCAAGCCC TATTCAGAAG AAGAGTGGAC AGCGATCAAC GCACTGGGCA TGATGGTTGA TTATGAATAC CAGAAGGGTG ATGTGCGTAT GACTATGGGC GGTGAACCTA CCTTCGTTTC CATCGATGAT ATGGAATCTG CTCAATGGAA CACAGAGGCA GATGGTGTTG AAAAACGTAT CCTGTCACAT GAATTATTAT TACGCCTGCG TGCTCGCTTT GCTCCTGCAG GCATGCTGCA TTACGGTCAG GGAAAATGGT ATCCCGGTGA ACCGCTTCCT CGCTGGCAGT ACGGTTTGTT CTGGCGCAAG GATGGTTTAC CGATGTGGAA GAACACCTCT TTACTGGCAC TGGAAAACGG AAAGAAAAAA TATACGCATA TAGATGCCGA AGCATTTATT ACGGAGCTGA CGAAAACACT TGCAGTACCG GCTTCAGACA TTTCTGCCCT ACATGAAGAT ATCTTTTATT TCCTGTGGAC AGAAGGAAAG ATCCCGGTGA ATATTGATCC GATGAGTTTT ACATTAAAGG ATCCGATTGA GCGCAGAACT TTAACTCAAC TGTTGGATAA AGGCCTGGAA ACACCGGCAG GTTTCGTATT GCCATTAGAA TGGAGTCATA GTAAGAACTC CTGGAAAAGC GGTAAGTGGA GAATGAAACG TGAAGAGATA TTTCTTATCC CCGGAAACTC CCCTATTGGT CTGCGATTAC CGTTAGATGC GCTTTCTAAA ATTGCAGCTG AATATGAGCC TTATGAAATT GAGCGCAGCC TGTTTGAAAC AATTCCCTAT CTGCAAAATG ATTATACAAA TACTGTACGT TCACGTTACG GCAGTATTGT AGAGCATGAG GACACTCCCG CGCGCAGAAA AATCAATGAA GCTGTTGAAG AAGAAAAGCA ACATTCGAAA CATAAAAAAA TTGAAAAAGA AATAAAAGAA GCCCAGCCAT TATTTTATGT TCCGCTTATA AAGACGGCTT TATGTGTTGA AGTACGTGAT GGCATATTAT ATGTATTCTT ACCACCGCTT AAATACATTG AACACTATCT TGACCTGCTT ACGTCGATTG AACTGACGGC GGAAAAACTT CAGATCCCGG TTCGCATTGA AGGCTACGAA CCGCCACGCG ACAATCGTAT TGAACGGCTT GTAGTATCCC CTGATCCGGG TGTGATTGAA GTAAATATTC ATCCGGCAAA AGACTGGAAG GAAATGTCTT ATAATCTGGA AGTATTATAC CAGGAAGCAT TTAAGACCCG GTTAGGGACG GAGAAATTCA TGCTTGACGG CAAGCATACC GGTACCGGCG GCGGTAACCA CATTACCATC GGCGGTGCTA CGCCTGGCGA TAGCCCGCTG CTGCGCAGAC CGGATCTGCT TAGAAGTATC ATAACGTTCT GGCAACACCA CCCAGGCCTT TCGTATTTAT TTTCAGGCGC GTTCATCGGT CCTACCAGCC AGGCACCACG CTTTGATGAA GGCCGGGATG AAAAATTATA TGAGATGGAA ATTGCCTTCG CGCAGATTCC GAATCACGAT GAAATTCCCT TCTGGCTTGT TGACCGTTTG TTCCGTCACT TGCTGACCGA TATTACCGGA AACACCCATA GAGCAGAATT ATGTATTGAT AAATTGTATT CCCCGGATTC TTCCAGCGGC CGCTTAGGTA TTCTCGAGTT CAGAGCATTC GATATGCCGC CGCACAAAGA GATGAGCATG GTGCAGATGC TGTTAATCCG TACGCTGATC GCCTGGTTCT GGAAGACGCC GTATAAAAAA GACCTTGTAC CGTGGGGTAC AGAGCTGCAT GATAAATATA TGCTGCCGCA TTTTGTGGAG CAGGATATGA AAGACATTGT GCGTGACCTG AATGATGCGG GATATCCATT CCAGATGAGC TGGCTGGCGC CGTTCTTCGA ATTCAGGTTC CCGCATTACG GAACTGTTCA CATCAAAGAC GTGCAGATCC AGATCCGTAT GGGTATTGAG CCATGGCATG TATTGGGCGA AGAAATGACC AGCTCGGGCA CCTCGCGTTA TGTAGACTCC TCGTTAGAGC GCCTGCAGGT GAAGCTGACC AACATGAATG ACTCGCGGTA TGTGCTTGCC TGCAATGGTT TCAAAGTGCC GTTAAAACCA ACGCCTGTAA AAGGTGAGTT TGTATGCGGG ATCCGGTACC GGGCATGGCA GCCGCCATCC GCACTACACC CGACGATTGG GATTGATACA CCGCTGGTAT TCGATCTGAT TGATACGTGG AACGGCAGAT CTGTTGGCGG ATGTACGTAT CACATCGCAC ATCCGGGCGG AAGAAGTTAC GATACCTTCC CGATCAATAG CTATGAAGCG GAATCAAGGC GTGTATCGCG CTTCTGGGAC AACGGGCATA CACAGGATCC GGTCCCTGCA CCGGATTACG TTACATCAAC ACCGATCGGC CGTTATATGA TTGAGATAAA TCCTGGTTTG AAACAGTTTG AATTAACAAA CATCGAAACA AATAAAGCCT ATCCGAATAC ACTGGATATG CGTCTTTGCT GGAAAGCAAA GAATAAGGAC AGGTAG
|
Protein sequence | MAIKVAINHK TTYSYDRLVN LSPHVFRLRP ASHSRTAIEA YSFKVSPANH FINWQQDPFG NYQARVVFPE QTKELSIEVE VIANMVVINP FDFFVEEYAS KFPFQYQGQL VKELAPYLEV KDDGPMLEEW MKTVSRESIE IVDFLVYINQ KVYKDINYSI RMEAGVQTPD ETLGLALGSC RDSAWLLVQA LRKLGLAARF VSGYLVQLKA DVEALDGPSG PPADFTDLHA WAEVYIPGAG WVGLDATSGL FAGEGHIPLA CTPDYLSAAP VVGATSKCET TFSFSNVVTR IHEDPRVTKP YSEEEWTAIN ALGMMVDYEY QKGDVRMTMG GEPTFVSIDD MESAQWNTEA DGVEKRILSH ELLLRLRARF APAGMLHYGQ GKWYPGEPLP RWQYGLFWRK DGLPMWKNTS LLALENGKKK YTHIDAEAFI TELTKTLAVP ASDISALHED IFYFLWTEGK IPVNIDPMSF TLKDPIERRT LTQLLDKGLE TPAGFVLPLE WSHSKNSWKS GKWRMKREEI FLIPGNSPIG LRLPLDALSK IAAEYEPYEI ERSLFETIPY LQNDYTNTVR SRYGSIVEHE DTPARRKINE AVEEEKQHSK HKKIEKEIKE AQPLFYVPLI KTALCVEVRD GILYVFLPPL KYIEHYLDLL TSIELTAEKL QIPVRIEGYE PPRDNRIERL VVSPDPGVIE VNIHPAKDWK EMSYNLEVLY QEAFKTRLGT EKFMLDGKHT GTGGGNHITI GGATPGDSPL LRRPDLLRSI ITFWQHHPGL SYLFSGAFIG PTSQAPRFDE GRDEKLYEME IAFAQIPNHD EIPFWLVDRL FRHLLTDITG NTHRAELCID KLYSPDSSSG RLGILEFRAF DMPPHKEMSM VQMLLIRTLI AWFWKTPYKK DLVPWGTELH DKYMLPHFVE QDMKDIVRDL NDAGYPFQMS WLAPFFEFRF PHYGTVHIKD VQIQIRMGIE PWHVLGEEMT SSGTSRYVDS SLERLQVKLT NMNDSRYVLA CNGFKVPLKP TPVKGEFVCG IRYRAWQPPS ALHPTIGIDT PLVFDLIDTW NGRSVGGCTY HIAHPGGRSY DTFPINSYEA ESRRVSRFWD NGHTQDPVPA PDYVTSTPIG RYMIEINPGL KQFELTNIET NKAYPNTLDM RLCWKAKNKD R
|
| |