Gene CHU_2524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2524 
Symbol 
ID4187049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2901358 
End bp2904708 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content41% 
IMG OID638072517 
Productsubtilisin-like serine protease 
Protein accessionYP_679120 
Protein GI110638911 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACA TTATAACCAT TAGATTTCAA CACATTGCAG GTATACTTTT TTTGATTTTC 
AGCAGCCTGA ATGTGTATGC ACAAAAAAAA GCTGTTCCGG CAGATTTTGA AGCAAACATC
CTGTTAATTA AAGTGAAACC GGAATTCATC ACATTTTTCA AGGATGCAAA CAGTGTGGAA
ACATTTAAGA TAAAAATTGC GCCGGCAATT CTTAAAACGG TGTGTAAAAC ATTTCCTGGT
ATTGAAGCGC CAACAGAACC GGATCACGTA GACTTATCAC GTATTTATAC AATAACCCTT
GAAGGCAAAA CAAAAGAAGA ACTCCCCTCT TTTGCACGTT TAGTAATGTC CTTCGGCTAT
TTTGATTATG TAGAATTAAA CTATATTCAG GAAAAACAAG CGGAATTTTA TCCCAACGAT
CCGGCAGTTA CCAGCGGCAA CCAGCATTAT TTAGGCCGTA TGGGTGCGTA TAAAGCCTGG
GCCATTGAAC AGGGAAACCC GAACGTTGTT ATTGGCATCA TTGATACCGG GGTTGATTTT
ACGCACAACG ACATTAAAGA TAATATTGCC TACAACCTGG CAGATCCGAT CAATGGTATA
GATGACGACG GAGATGGATA TGTTGATAAT TATCAGGGCT GGGATCTAGC CAATGGCGAC
AACGACCCGA ATGTTTCTCC TTCTGCAAAC CACGGAGCCA CCGTAGCAGG CGTAGCGGGC
GCTACGGCAA ACAATTCGCT GGGCGGAGCA GGTATTGCAT ACAACTCAAA AATACTTCCT
ATTAAAGCAT CGCTGGACGG CACTTCCGGT GCCATTAGTA AAGGGTATGA TGGTATAATT
TATGCAGCAA ACCACAATTG CAAAGTAATC AATTTGTCCT GGGGCGGAGT AGGAAATTAT
TCATCCGTTG ATCAGGATGT GATTAATTAT GCAGCCATCA ATAAAGATGT GTTACTTGTA
GCAGCCGCCG GCAACACAGA CCAGGAGGCC GATTATTATC CTGCGGCATA TGATAATGTA
CTTTCTGTTA TTGCAATGGA TACCATGTTC TCTTCCGCTG CAAATAAATA CATTGATACC
CGTGCAAGTT TTACAAACTG GGCATGCTGC TACAAGGCAA CGTATGCACG TTCTGTTGAC
ATTGGTGCGC AGGGAATGAA TTTGTATACA ACAATTCCAG GTAACGGCTA CATGAAACAG
GACGGCTCGT CTGCTGCAAG CCCGGTTGTT GCAGGAGCGG CAGCGCTGGT GCGTTCGCAT
TTTCCGAACA TTTCTGCCTT ACAGGCTGCA GAATTATTAC GTGTAACGGC CGACATCGTT
GATACATTCC CCGAAAATGC ACTGTATAAG GAAAAAATGG GCAAAGGAAG AATCAATGTC
TACAGAGCAT TAACTGATAC GCAATCTCCT GCTATCCGCT ATAAAAATTT AGTGCTGAAA
TCTGCTTACA GCAATCTGTT GTATACAGGA GATACCGTAT ATGTAACAGC AGATTTTTTT
AACTATTTAC GACCTTCTGC TACTTTAACC ATTGATCTGT CTTGTACTTC ACCGGATATA
ACTGTTATTT CAAACACATT TCAGGCATCG GTCATAGACA GTCTGCAGGC TAAATCAAAT
GCTTCCTCAC CGTTTAAATT TGTTATAAAA AATACCGCGC AGAATGATCA GGTCATTGAG
TTTCGCATTG GTTATACAGA TCCGGCGAAA GCATACACCG ATTATCAATA TTTCAAGATC
CAGATCAACC CTGCCTATAC AACACTATAC AACGATCATG TGCAAACCAC TATCACTTCA
AACGGACGGT TTGGCTATCA GGATATGTAC AACACGGTTG GCATAGGTTT TCTTTCAGAG
GGAATTAATG TTATTTATGA AGGAGGTTTA CTGATCGGTC AATCGCCAAC AAAAATTTCC
GATTGCGTAC GCGGAACTAC AACAACGGAA ATGGATTTCA AATCACTCAC TACACCTGGT
TTCTTTTCCA GCCCTATCAA GTACCGCGAA GTAGTATCCC GATTCAATGA TTCTTCTTCA
ACCAATACAA ACATACAGGG TATTGAATGT ATTCAGCGCT CGTATGCCTT CAATACTTCA
GATCAGGCTA ATACCCTATT CCTGGAATAT AAAATTATTA ATCATTCTTT GACACAAATT
GATTCGTTAT ATGTGGGTCA TTTTATCGAT TGGGATATAG AAAATTATTA TTCCAACCGT
GCCGGATATG ATGTAGATGC ACGCTTGGGA TATGCCTATA ATATCACCTC AAATGACTTA
TATGCCGGCA TCTCCTTATT AACAAATCAA ACGGTTAACT ATTACGCCAT GGACAATTCA
TTTGTGGGTG GAACAAACAT AAACCCGAAT GATGGGTATG ACGATGCCGA AAAATTCAGA
TCCATATCAA GCGGCTTCGG CCGTTTCGCT GCCGGAACAA ATTCAAGCGG AAACGATATC
TCAATGACCA TGGCCGGGCG CATCAATCAT TTAAAACCGA ACGATACCGT TACCATTGCC
TTTGCATTAC TTACTTCCCA CACCACTTTG CTAAACCTAA AACTTGCAGC AGCACGGGCA
AGACAAAAAT TTATTGAAAT ACATACAGCC CCTGTACCTA AAGCAGATTC TGTAAAAATC
TGCCGCAATG ATGCCACTGA TTTAATCATA AAACCAACAC CCGGCCAACT ATTTAATTTT
TATTCAGAAA AACCTGTACT GGCATCTGTG CCAGTATTTA AAGGAAATGC CTATACACTG
GCCAATGTTT CTGTTGCTGA TACCATTTAC ATTACAAGTA TGGATTCCTT ATATGAATCT
GCATTTTCAA CGTATGCAAT TCTTGATAAA AGACCTGTAG CAGCATTTAG TTATACTACA
GCAACAGCAA CAAATGAATC TGCATTCATT AATGAAACGC CACAATACCA ATCCCTTCTT
TGGGATTTCG GAGACGGCCA GACATCAACC GATGAAAATC CGATGCATGC ATATGCGCTG
CCGGATATGT ATACGGTAAT ATTAAAGGCA AGTAACGAAG CCTGTATCGA TTCTGTTGAG
CATACCTTTG TGATCACCTC TCCTCCCTTA AATACTACCG GTCAGCTGAA TACGACAGAT
GTCCGCGTGT TTCCCATACC CGCGCAGGAC GTGTTGATTC TTGAGTTTCA AAGTAATGTT
TCCGAAACGA TTGATGTGGA ATTATTTAAT GCAACAGGAC AATTCATTTC AGAGAAAAAA
AATATTCATA GCGCGAACAA TCAAGTTCAG CTGAATGTTT CTAACTTGGC TGCAGGACTT
TATTTCATTC AAATTAAAAA CACAAATACA GTACTTCGTT TTGTGAAGTA A
 
Protein sequence
MKHIITIRFQ HIAGILFLIF SSLNVYAQKK AVPADFEANI LLIKVKPEFI TFFKDANSVE 
TFKIKIAPAI LKTVCKTFPG IEAPTEPDHV DLSRIYTITL EGKTKEELPS FARLVMSFGY
FDYVELNYIQ EKQAEFYPND PAVTSGNQHY LGRMGAYKAW AIEQGNPNVV IGIIDTGVDF
THNDIKDNIA YNLADPINGI DDDGDGYVDN YQGWDLANGD NDPNVSPSAN HGATVAGVAG
ATANNSLGGA GIAYNSKILP IKASLDGTSG AISKGYDGII YAANHNCKVI NLSWGGVGNY
SSVDQDVINY AAINKDVLLV AAAGNTDQEA DYYPAAYDNV LSVIAMDTMF SSAANKYIDT
RASFTNWACC YKATYARSVD IGAQGMNLYT TIPGNGYMKQ DGSSAASPVV AGAAALVRSH
FPNISALQAA ELLRVTADIV DTFPENALYK EKMGKGRINV YRALTDTQSP AIRYKNLVLK
SAYSNLLYTG DTVYVTADFF NYLRPSATLT IDLSCTSPDI TVISNTFQAS VIDSLQAKSN
ASSPFKFVIK NTAQNDQVIE FRIGYTDPAK AYTDYQYFKI QINPAYTTLY NDHVQTTITS
NGRFGYQDMY NTVGIGFLSE GINVIYEGGL LIGQSPTKIS DCVRGTTTTE MDFKSLTTPG
FFSSPIKYRE VVSRFNDSSS TNTNIQGIEC IQRSYAFNTS DQANTLFLEY KIINHSLTQI
DSLYVGHFID WDIENYYSNR AGYDVDARLG YAYNITSNDL YAGISLLTNQ TVNYYAMDNS
FVGGTNINPN DGYDDAEKFR SISSGFGRFA AGTNSSGNDI SMTMAGRINH LKPNDTVTIA
FALLTSHTTL LNLKLAAARA RQKFIEIHTA PVPKADSVKI CRNDATDLII KPTPGQLFNF
YSEKPVLASV PVFKGNAYTL ANVSVADTIY ITSMDSLYES AFSTYAILDK RPVAAFSYTT
ATATNESAFI NETPQYQSLL WDFGDGQTST DENPMHAYAL PDMYTVILKA SNEACIDSVE
HTFVITSPPL NTTGQLNTTD VRVFPIPAQD VLILEFQSNV SETIDVELFN ATGQFISEKK
NIHSANNQVQ LNVSNLAAGL YFIQIKNTNT VLRFVK