Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1208 |
Symbol | |
ID | 4185361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 1383452 |
End bp | 1386547 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638071203 |
Product | metalloprotease |
Protein accession | YP_677821 |
Protein GI | 110637614 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.226468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC CGTATAGCAC GTTATTTTTA TTGCTGTTAA CAGCAACACA CACGTTTGCG CAAACATCCT TTGATGTAAA AGAAAAACAG AGCAATCCCG CTGCACACCG CTCCTTTGAA GCAACACGCA TCAAAGGAAA TATGCTCCAG CAATCCATAT TAACAGCGCA AGGAACAAGC CTTCTTCCGC AGCTTTCAAA ACCAAACACA GCAGCTGTAC GTGCTACAGC AAAAAATCCG TTCAGCGTAA TTTATTCAAA TGAAACCGGG CTGCCAATCT TCATTAAAAC CATCATTCCT CAGACACTGC AGCAGCGCGC TGTTGGTACA GGAAGTGGAA TTGCGATTGC GTACAACTAC ATTGATCAGC TGCGCGAAAC ATTAGGCTTA ACAGACGTTG CCGAGCCATT TACGCCATAC AAAACAGAAA AAGATCTTTT GGGCGGACAA ATTATCCGGT TAAAGCAATT TTACAACGGC ATTGAAATAG ATGGTTGTGA AAGTATTGTA CACATCAATG CAAGCGGACA GGCTGTTTCG TGGAACGGCA GTTACATTAA ACCTGATCTT ATTAAACACA CCTCTTTTGC GGTCACGCCC GCCGCGGCGG CGGCAAAAGC GCTTGCTGAT ATTAAAACAC ATGCACACTA TGTAGAACTG TCTGAACAGG AACAGCAGTT TTTAAATTAC AGTACACCAG GCATCAAACA AATCTACTAC ATTGACGACA AACTTGTGCG GAGCTGTGTA CCGGCTTACA GCATTGATGT ACGTCCAAAC TTTCTGGATT GGTGGGAATA TATTATTGAT GCACAAACAG GAAACATCCT TTCATCGCAT TCCAAAACAT GCCATGCCGA CGGTCCGCGC ATAAGTACCG GCAACGACCT CAACGGCGTA TCACGTACGA TTAATACCTA CCAGACAGGT TCATTGTATT ATACGACAGA TGCCAGCAGA AGCATGTTTA AATCAAGCCA GTCTTCATTC CCCGACAATC CTGCCGGTGC CATTCAAACG CTTGATCTGA ACTATACATA TGGTTCCAAT ACCAAATACA AAGCTATTAC TTCCAGCACC AACAGTTTTA ATGCTACGGC AATTTCGGCA CATTATATTG CCGGCAAGTC ATACGATTAC TATTCTGCCA TACATGGCCG TACTTCTATT GATGGGAATG GCGGCACAAT TATCTCTTTC ATAAATGTTG CTGACCCGGA CGATGGAACA CCAATGGACA ATGCGTTCTG GAATGGTAAA GCCATGTATT ACGGCAATGG AAATACAAAC TTCAAACCCC TGGCCGGCGG CCTGGATGTA GGCGGGCATG AACTGACGCA CGGTGTGATC CAGAATTCTG CCAACCTCAA TTACCAGGGC GAATCCGGTG CCATCAACGA ATCAATGGCG GATATTTTTG GCTGTATGAT TGATTCTCTG GATTGGAAAA TCGGTGAAGA TGTTGTACTT CTAAGTAAGT ACCCTTCCGG TGCCTTACGT GATTTATCCA ACCCGCACAA TGGCGGCACA AATATAAATT CAAGAGGCTG GCAGCCTGCA CATGTATCTG AAAAATACTC CGGAACACAG GATAATGGCG GCGTACACAT AAACAGCGGT ATAACCAATT ATGCTTTTTA TTTATTGGCA CAGTCTACTT CAAGAAGTAA GGCTGAAAAA ATATTTTACC GTGCATTAAC CGCCTACCTT ACGCGTTCTT CTAAATTCAT TGACCTCAGA ATTGCATGTA TCGCTGCGGC AACCGATTTA TATACATCAA ATGAGGCAAC GAAAACAGGA ATCGCTTTTG ACCAGGTAGG GATTACAGGA AACAGTGAAG TACCTACAAC ACCTGTATCA AGCAACCTGC CAGTGAATAC AGGCGATGAA TACTTGCTTA CGTATAATCT GAACACAACC TACAGCACGA AATTATACCG CATAAATACG GCGACACAAG CTTATGCTAC CATCAATACA AGTTCGGTAT TCAACAAGCC CAGCATAACC GACGATGGTT CAATGGCGTA TTTTGTAAAC ACGGCCAACC AGCTGAAAAG CCTGTACCTG ACACCGGGCA ATACGTACGA ACAGATTATT CAGGACGAAC CTATCTGGAA CAATGTTGCA ATCAGTAAAA ACGGCAAACG TTTGGCCGCA ACAACAACCG ACAAGGATAC ATCTGTTTAT GTATATGATT TCGATAGTGA TACCTGGGCA CAATTTGTGT TGTACAACCC TACCTATTCA GAAGGAATTA AATCAGGCGG GCCTATCTAT GCGGATGCAT TGGAATGGGA TCACACAGGA GAGTTTTTAG TATACGACTG TTATAACGAA TTTGAAAATA CATCCGGTAA CAACATCAAT TTCTGGGACA TCAATTTTAT TCAGGTGTGG GATAATACCC TGAATGATTT TGGAGACGGA ACTGTAACAA AATTATTTTC ATCGCTGCCG GATCACATAA GTGTGGGTAA TCCTGCGTTT GCAAAAAATT CTACCAATAT CATTGCCTTT GATTACATAG ACGAAGATGA AGGTGATTTA TATGTTATCG GCTGCAATAC AGAAACAAAT GAACTGGATG TTATTACAAG CAGCAATGTA CTTGGTTTTC CGAATTTTAA CAGACTTGAC AATAAGATTG CTTATCTCTA TGAATTAAAT TCAGACCATA TGAAATCTAT CTGGACAGTT GATCTGGATG AAAGCAAAAT TACAGCGCTG CCAAACGGTT CAGATACATA CTATACAGAC AAATCAAACT GGCCGGTTTA TTATGCAACC GGCGTACGAT CCCTTCCCTC TTCTACTACA GCAAAGCACA CGAACACTTC GGAGAATCGT GCAAACGTAT ATCCGAATCC CGCATCTGCA GATTTCAGCA TACGGTTAAC GTCCGGTGAC CAAAGCCATG CCGTTATACA TATCAACAGT ACAACCGGCC AGCTCATTTA CAGTACTGCT GCCAATCTGC TCACAGGGGA AAACACCATT CCTGTTCAGC TGCCAGCATC TGTTGCTTCA GGATATTATA TTGTAACCAT TGAAACTGCT GATGAACGCT GGGTAAGCAA GCTGATTAAA AAATAA
|
Protein sequence | MKKPYSTLFL LLLTATHTFA QTSFDVKEKQ SNPAAHRSFE ATRIKGNMLQ QSILTAQGTS LLPQLSKPNT AAVRATAKNP FSVIYSNETG LPIFIKTIIP QTLQQRAVGT GSGIAIAYNY IDQLRETLGL TDVAEPFTPY KTEKDLLGGQ IIRLKQFYNG IEIDGCESIV HINASGQAVS WNGSYIKPDL IKHTSFAVTP AAAAAKALAD IKTHAHYVEL SEQEQQFLNY STPGIKQIYY IDDKLVRSCV PAYSIDVRPN FLDWWEYIID AQTGNILSSH SKTCHADGPR ISTGNDLNGV SRTINTYQTG SLYYTTDASR SMFKSSQSSF PDNPAGAIQT LDLNYTYGSN TKYKAITSST NSFNATAISA HYIAGKSYDY YSAIHGRTSI DGNGGTIISF INVADPDDGT PMDNAFWNGK AMYYGNGNTN FKPLAGGLDV GGHELTHGVI QNSANLNYQG ESGAINESMA DIFGCMIDSL DWKIGEDVVL LSKYPSGALR DLSNPHNGGT NINSRGWQPA HVSEKYSGTQ DNGGVHINSG ITNYAFYLLA QSTSRSKAEK IFYRALTAYL TRSSKFIDLR IACIAAATDL YTSNEATKTG IAFDQVGITG NSEVPTTPVS SNLPVNTGDE YLLTYNLNTT YSTKLYRINT ATQAYATINT SSVFNKPSIT DDGSMAYFVN TANQLKSLYL TPGNTYEQII QDEPIWNNVA ISKNGKRLAA TTTDKDTSVY VYDFDSDTWA QFVLYNPTYS EGIKSGGPIY ADALEWDHTG EFLVYDCYNE FENTSGNNIN FWDINFIQVW DNTLNDFGDG TVTKLFSSLP DHISVGNPAF AKNSTNIIAF DYIDEDEGDL YVIGCNTETN ELDVITSSNV LGFPNFNRLD NKIAYLYELN SDHMKSIWTV DLDESKITAL PNGSDTYYTD KSNWPVYYAT GVRSLPSSTT AKHTNTSENR ANVYPNPASA DFSIRLTSGD QSHAVIHINS TTGQLIYSTA ANLLTGENTI PVQLPASVAS GYYIVTIETA DERWVSKLIK K
|
| |