Gene CHU_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3802 
Symbol 
ID4184138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp4390192 
End bp4392528 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content41% 
IMG OID638073788 
Productprotease 
Protein accessionYP_680377 
Protein GI110640167 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.239343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTAT TCTCCCTCTC TGTGCAGGCA TCTTACATGA AGGATGTACC TGTTAAAGTA 
AAGCAACCTG ATGGAACAGA GTTAACGTGT CTGGCAACGG GTAATGAATT TCACAACTGG
CTTCACGACG CAAAAAATTT CACTATTATA AAAAATCCTG TTACGGGATA TTATGTGTAT
GCAATTTTAG TGAATGGAAA ATTAGCACCG TCAGCATTTA TTGCTGGCAA AACCGATCCG
GCTTCAAAAG GTATTTTACC GGGTGTCAAT CTGAGTGAAA CCGAAGTAGC TAAAAAGACA
CAGGAACTGA AAGCTCAAAA AGCAATTCCT CAGTCAAACC TGAGAACTGC TGCAGCAGCT
CCTGTACGCC CGGCTACAGG TATCTATAAC AACCTTGTTA TATTTATCAG CTTCAGCGAT
CAGGCAGAAA TTTCAACGCC GCTTTCGCAG ATTACACCAA AATTTAATAC ATTAGGAATA
AATTCTGTGC GTGATTTTTA CAAAGAAGTT TCAAACAGTC AGCTGGATGT GGTAACAAGT
TTTTATCCGG TTTCTTCAGG AACATATCCG ATCTCGTATC ACGATTCTCA TCCAAGAAGT
TATTATACAA AAGATGCGCC GGATGGGTAC AACAATACAA GTCCGGATTT AACAACACGC
GAGCATACCT TATTGGCAAA TGCAGTAAGA GCTGTGCAAA ATCAGGTGCC GGCAAACTTA
TTGATCGATG GGGATAATGA TGGAAACATA GACAATATTG TTTTTGTAAT CAATGGATAC
AATGAAGGCT GGTCAGATCT GTTATGGCCG CACCGTTGGT CATTGTTCTC TGAAACAGTT
TCCATTAATG GGAAAAGAGC ATACGACTAT AATGTTATTT TTACAGAAGC ATTAGGCGTT
GGTGTGATCT GTCACGAATT GTTCCATTCT TTTGGCGCAC CGGATCTATA CCATTATACC
GATTATCCGC ATGATCCGGT GGGTGATTGG GATTTAATGG CAAGCGGCGA CTGGAATACA
CCAAGACATA TGACCGTACA TTTTAAACAA CGTTATGCGC ATTGGGTCGC CAGCGTACCA
ACACTTACTG CAGCAGGAAG ATATAGCCTG GCTCCGTTGT CACGTTCTCC TTATGCAGCT
TACAGAATCA ACCTGCCTGA TACAGATCAG TTCTTAATGT TAGAATACAG AAAAAAAGAA
GGCCGCTATG AATCCGCACT AAGCGGACAG GAAGGTTTAA TCATCTATAG AGTCAATCCG
AATATTGAAG GGAATGCAAA TAATGGTCCG GAAGAATTAT ATATCTTCAG ACCGGATGGA
ACAAACAGTG TAAATGGAAG TTTGTATCAG GCAGCCTTTT CTCAAACAAA CAGCAGGACA
CGTTTCGATG ATTATACCAA TCCGGCATGT TTTCTTACAG ACGGAACACC GGGAGGCTTA
CCTGCTTTTT ATGGCATTAC AAATGTATCT GTACTGGGTG ATGTTATAAG CTTTGATTAT
ATGGGTGGAG ATACAGGAAA TAAACTTCCT GTTGTTCAAA TTACAAATCC TAAAAGCACA
TCAACACTTA CTGCTCCGGC ATCGTTTGTA ATTGTTGCCA ATGCAAGTGA CGCAGATGGA
AGTATTGCAA AGGTAGAATT TTTCAACGGA ACAACATTAT TAGGAACAGT TACTTCAATA
CCATATAGCT TCTACTGGCA AAATGTTGCA GCAGGTACCT ATACAATAAT TGCTAAAGCT
ACAGATAATG GAGGTGCAAC CGCTACTACA TCTGTAACAA TAACAGTACA GCCAGGTGCT
TCACTGGAAG ATATTATCGG AAATGCATGC GGACAGAATG GCGGCACGGC TACATATAGC
CTGAATGCCT CTAACCGTAC AAATGCTACC GGATATAACT GGTGGTATAC AGGAAACAAG
CAATCCTTAA CACCTGTTGT GAATCAGCCT TATAACGCAG TAATGACATA TGGTTCGGCG
TTCTCCGGTG GTCAGTTGTG TGTAGGTGTA AACTATAGTG CTGCTCCGTG GTACAAACAA
TTCTGTAAAA ATCTTTCTGC TTGTCCATCA AACATTAATG CGTTCAGAGT AGATGAAACA
GAACAAACCA CAACAGTAGT TTCGCCAAAC CCATCTGTTG AAAGCTTTAC AATAACGTTG
AAAAAGCCAT CGGCGTTGAT CACGATAATG GATTCAAAAG GTACGGCTGT AAAACAACTG
TCTGGTGCGG AAGGAACGGT TGAATTTGGC AATGATCTGC TTACAGGAAT TTATATGCTT
TCTGTTGTGT ATACAGATAA CACAACAGAA AATATTCGCG TGGTGAAAAT AAAATAA
 
Protein sequence
MFVFSLSVQA SYMKDVPVKV KQPDGTELTC LATGNEFHNW LHDAKNFTII KNPVTGYYVY 
AILVNGKLAP SAFIAGKTDP ASKGILPGVN LSETEVAKKT QELKAQKAIP QSNLRTAAAA
PVRPATGIYN NLVIFISFSD QAEISTPLSQ ITPKFNTLGI NSVRDFYKEV SNSQLDVVTS
FYPVSSGTYP ISYHDSHPRS YYTKDAPDGY NNTSPDLTTR EHTLLANAVR AVQNQVPANL
LIDGDNDGNI DNIVFVINGY NEGWSDLLWP HRWSLFSETV SINGKRAYDY NVIFTEALGV
GVICHELFHS FGAPDLYHYT DYPHDPVGDW DLMASGDWNT PRHMTVHFKQ RYAHWVASVP
TLTAAGRYSL APLSRSPYAA YRINLPDTDQ FLMLEYRKKE GRYESALSGQ EGLIIYRVNP
NIEGNANNGP EELYIFRPDG TNSVNGSLYQ AAFSQTNSRT RFDDYTNPAC FLTDGTPGGL
PAFYGITNVS VLGDVISFDY MGGDTGNKLP VVQITNPKST STLTAPASFV IVANASDADG
SIAKVEFFNG TTLLGTVTSI PYSFYWQNVA AGTYTIIAKA TDNGGATATT SVTITVQPGA
SLEDIIGNAC GQNGGTATYS LNASNRTNAT GYNWWYTGNK QSLTPVVNQP YNAVMTYGSA
FSGGQLCVGV NYSAAPWYKQ FCKNLSACPS NINAFRVDET EQTTTVVSPN PSVESFTITL
KKPSALITIM DSKGTAVKQL SGAEGTVEFG NDLLTGIYML SVVYTDNTTE NIRVVKIK