Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_3033 |
Symbol | |
ID | 4184966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 3475831 |
End bp | 3477390 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638073022 |
Product | serine protease |
Protein accession | YP_679616 |
Protein GI | 110639407 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACTA TTATTCGCGC GGTAAGTGCA GTACTGTTTT TTTCTGTTGT TGTTTTACAA ACGAATGCTC AAATTCAAAA ATATTGGATT TCATTCAAAG ATAAAGAGAC TGTTGGATAT AATTATAAAA ACAATTTAAG CCCTCAAACA ATTTTAAACA GAACGGCTTA TTCAATTCCA TTACATCAAT ACACAGATAT TCCTGTTTCA AAAATATTTA TTGATTCAAT TGCAAAGCTG GATGTTTTAA TCATTGCAAA ATCAAAATGG CTGAATGCGG TAACTGCAAA CTTAACACGG GAACAGGCCG AACAAATAAA ACAGATTTCT TTTGTTGCTT CTGTTGAGCC GGTGAATATT TATTTGGTCG GATCTTCTAC AAATGAGCTG GAAATTGCTC CTGAACTGAT GCATGCTGCC ATGAAACAAA TGAAATCAAA AGCGTTTAAT GAAAAAGGCA TTGATGGTAA GGGGATCCGT GTAGGTGTTA TCGATGCAGG TTTTTACAAA TTACATGAAG ATCCGGCAAC AAGTTATTTA GTGCAGGATA AAAAAATATT GGGACAGCGT GATTTTATTG ATAAATCAAG AACAGACTTA ATTGTAAATG CAGCAACATC TGCCGACGAC CACGGCAGGC AAGTTGTTCG GATGATTGCA GGTTATGATA CCTCTATTAA AGCACAATAC GGCATGGCTG TTAATGCATC TTTTTATCTG GCAAGAACTG AAAACGGCGA AAGAGAGTAC AGGGGCGAAG AAGATATGTG GATCATGGCA ATGGAGTGGA TGGACAGCTT AGGCGTTCGA TTAATAAGCA CTTCGTTGGG TTATGCTACT AAAATGGATG ATCCGAATGA CAATTATAAG CAGTCTGAAA TGGATGGAAA GACAGCACGT ATCACAAAAG CAGCGCAGAT TGCATTTTAT CAAAAGGGAA TTTTCTTAGC TGTTTCGGCC GGCAATGAAG GCGACACGCA ATGGAGAATT ATTTCGGCAC CAGCTGATGC TGAAGGAGCC TTAGCTGTAG GTGCTACAAA AGCTTCCACC TGGGACCGCA TTTCTTACAG CAGTATCGGC CCGGAACCAT TGCCGTATCT GAAACCGAAC GTGTCCTGTT ATTCTCCAAA CGGTACATCA TTCTCTTGTC CGGCTGTAGC AGGATTTGTT GCTTGTATGA TGAACAATGA TTCAACCTTA ACCAACGTTC AGTTAAAAGA AATTATTCAG CGTTCAGCGC ATCTGTATCC ATACGGAAAT AATTTTATAG GTTATGGCAT TCCGCAGGCT GATCGTGCAT TGGTATTAAG CAAGGATCAA AATACGGATT TTGGGAAAGC CGTTCTGATT CATAATTCGA AAAAGGTTTT TAAACATACA TTCGACAAAT CCATAAAAGT TGAACTGGTG CTGTTCCATA AAAAAAATGA AACGATTGTA ATTGATCAGC AAGTAATAAT GGTAAAGAAA GGAAAGCTCA AGATAAAGCG ACCTAAAAAT GCTGAACGTA CAACAATTGT TGCAGACGAG TTTTTAACGC TTGAAATAAT TTGGGAATAA
|
Protein sequence | MQTIIRAVSA VLFFSVVVLQ TNAQIQKYWI SFKDKETVGY NYKNNLSPQT ILNRTAYSIP LHQYTDIPVS KIFIDSIAKL DVLIIAKSKW LNAVTANLTR EQAEQIKQIS FVASVEPVNI YLVGSSTNEL EIAPELMHAA MKQMKSKAFN EKGIDGKGIR VGVIDAGFYK LHEDPATSYL VQDKKILGQR DFIDKSRTDL IVNAATSADD HGRQVVRMIA GYDTSIKAQY GMAVNASFYL ARTENGEREY RGEEDMWIMA MEWMDSLGVR LISTSLGYAT KMDDPNDNYK QSEMDGKTAR ITKAAQIAFY QKGIFLAVSA GNEGDTQWRI ISAPADAEGA LAVGATKAST WDRISYSSIG PEPLPYLKPN VSCYSPNGTS FSCPAVAGFV ACMMNNDSTL TNVQLKEIIQ RSAHLYPYGN NFIGYGIPQA DRALVLSKDQ NTDFGKAVLI HNSKKVFKHT FDKSIKVELV LFHKKNETIV IDQQVIMVKK GKLKIKRPKN AERTTIVADE FLTLEIIWE
|
| |