Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1338 |
Symbol | |
ID | 4185843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 1561612 |
End bp | 1563477 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638071332 |
Product | U32 family protease |
Protein accession | YP_677950 |
Protein GI | 110637743 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.138533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.134167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAGTTGAAAT ACTTGCTCCT GCCAAAAACC TATATCAAGG AATGGCAGCT ATCAATGCCG GAGCAGATGC TGTATATATT GGCGCACCTC AATTCGGAGC ACGAACCAAT GCAACCAATC CGGTTGAAGA CATAGCAGAG CTTGTGCGTT ACGCACACTT ATTCAAAGCT CAGGTATTTG TAGTATTAAA CACTATTTTA TACGACAACG AACTAGACAC CTGTGAAAAA CTCATTCACG AGCTGTATCA TATTGGTGTA GATGCATTGA TCATTCAGGA CATGGCCATT ATGGAAATGA ACATCCCTCC TATTGTGATC CATGCCAGTA CACAAGCCAA TAACCGCGAT CCGAAACATG TAAAGTTTCT GGCAGATGCC GGCATGAAAC GCGCCGTGCT GGCACGCGAA TTAAATTTAG ATCAGATCAG AGACATTGCT GAAGCAACGG ATGTTGAACT GGAATTTTTT GTTTCCGGAG CCTTATGTGT ATCCTTCAGC GGCAATTGCT ACATGAGTAT TGCCGGCGGA GAGCGCAGTG CTAACCGCGG CTCGTGTGCA CAAAACTGCC GCTTGCCTTA TAACCTGATT GATGGTACAG GTAAAACACT TATTGCAAAC AGCCATTTAT TATCTATCAA AGATCTTGAC TTAAGCGATC AATTACCAAA TCTCATTGAA GCTGGTATTA CTTCGTTTAA AATTGAAGGC CGTTTGAAGG ATATTGTCTA TGTAAAAAAC AATGTATCCT ACCTGCGCAA AAAGCTGGAT GCGTTCCTTG AAAATAACGA ACGTTTTGAA AAAGCTTCCT CCGGGCGCAC ATTCTACAAT TTTGATGCTG AAATGGATCG CAGCTTCAAC AGAGGTTATA CCGACTATTT TGTAAACAAA AGAACAGAGC GGATCGGCTC ATGGGACACA CCAAAATCTC AGGGACAAGT AATCGGTAAA GTTATTGAAG TAAAACATAA CGGTTACGTC ATTGAAAATT CAGATAAACT AAACAATGGG GATGGTTTAT ATTTCATAAA TGAAGCCGGT GAAGCCGATG GCGCGCAAAT AAATACAATC ACCAATAATG TAGTTATTCC AAATACCTTT AAGCCAATTA AAGTCGGCAC AATGATTTAC CGGAATGCCG ATGCGGAATT CAATAAATTG GTTGAACGGG AAGACAGTGC GATCCGTAAG ATCGGCGTAT CGCTGCTATT CAGTGAAGTA CCTGAAGGTT TCCAGCTTAA AGCAATTGAT GAAGACGGAC ATGAAAGTAT TTCAACACTT GACGTTCAGA AGGAATTAAG CAAAAATGGC GACGGCGTTA TAGACAACAT TAAAAAAAAT CTGGCTAAAA CAGGAAATAC ACCGTTTATC GTTGACAAGC TGGACGTAAC GCTTTCAGCA AATTGGTTTC TGCCTATTTC AAAAATAAAT GAGATCAGAA GAATTGTATT AGAAGAACTG ATTGATGTAC GTGTTGCTTC ATACAATCGT AAAGAATATC AAATCAAAAA AACGGATCAT CCATATCCGG TAGAAAAACT CGATTTCATG TATAATGTAT CCAATAAAAT GGCCAGGACA TTTTACCACA GACATGGTGT TACTGAAATT GAAAAAGCAT TTGAATTACA ATGGGACCCG GGCAAGGCAC GTGTAATGAC AACCAAATAC TGCGTAAAAT ATGAATTAGG CAAATGTGCA CGCTATCAGC GCGACACCAT GGGCGAAAAA GTTGTCGAGC CTTTAGTATT AAAGCATGGT GAAAATGAAT ACAAACTTAA ATTCAATTGT AAACCTTGTG AAATGGAGAT CTGGGAAAAG GATGCCGATC TCGTTTTTGA TGAAGATGAT TATTAA
|
Protein sequence | MKKKVEILAP AKNLYQGMAA INAGADAVYI GAPQFGARTN ATNPVEDIAE LVRYAHLFKA QVFVVLNTIL YDNELDTCEK LIHELYHIGV DALIIQDMAI MEMNIPPIVI HASTQANNRD PKHVKFLADA GMKRAVLARE LNLDQIRDIA EATDVELEFF VSGALCVSFS GNCYMSIAGG ERSANRGSCA QNCRLPYNLI DGTGKTLIAN SHLLSIKDLD LSDQLPNLIE AGITSFKIEG RLKDIVYVKN NVSYLRKKLD AFLENNERFE KASSGRTFYN FDAEMDRSFN RGYTDYFVNK RTERIGSWDT PKSQGQVIGK VIEVKHNGYV IENSDKLNNG DGLYFINEAG EADGAQINTI TNNVVIPNTF KPIKVGTMIY RNADAEFNKL VEREDSAIRK IGVSLLFSEV PEGFQLKAID EDGHESISTL DVQKELSKNG DGVIDNIKKN LAKTGNTPFI VDKLDVTLSA NWFLPISKIN EIRRIVLEEL IDVRVASYNR KEYQIKKTDH PYPVEKLDFM YNVSNKMART FYHRHGVTEI EKAFELQWDP GKARVMTTKY CVKYELGKCA RYQRDTMGEK VVEPLVLKHG ENEYKLKFNC KPCEMEIWEK DADLVFDEDD Y
|
| |