Gene CHU_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1208 
Symbol 
ID4185361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1383452 
End bp1386547 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content43% 
IMG OID638071203 
Productmetalloprotease 
Protein accessionYP_677821 
Protein GI110637614 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.226468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC CGTATAGCAC GTTATTTTTA TTGCTGTTAA CAGCAACACA CACGTTTGCG 
CAAACATCCT TTGATGTAAA AGAAAAACAG AGCAATCCCG CTGCACACCG CTCCTTTGAA
GCAACACGCA TCAAAGGAAA TATGCTCCAG CAATCCATAT TAACAGCGCA AGGAACAAGC
CTTCTTCCGC AGCTTTCAAA ACCAAACACA GCAGCTGTAC GTGCTACAGC AAAAAATCCG
TTCAGCGTAA TTTATTCAAA TGAAACCGGG CTGCCAATCT TCATTAAAAC CATCATTCCT
CAGACACTGC AGCAGCGCGC TGTTGGTACA GGAAGTGGAA TTGCGATTGC GTACAACTAC
ATTGATCAGC TGCGCGAAAC ATTAGGCTTA ACAGACGTTG CCGAGCCATT TACGCCATAC
AAAACAGAAA AAGATCTTTT GGGCGGACAA ATTATCCGGT TAAAGCAATT TTACAACGGC
ATTGAAATAG ATGGTTGTGA AAGTATTGTA CACATCAATG CAAGCGGACA GGCTGTTTCG
TGGAACGGCA GTTACATTAA ACCTGATCTT ATTAAACACA CCTCTTTTGC GGTCACGCCC
GCCGCGGCGG CGGCAAAAGC GCTTGCTGAT ATTAAAACAC ATGCACACTA TGTAGAACTG
TCTGAACAGG AACAGCAGTT TTTAAATTAC AGTACACCAG GCATCAAACA AATCTACTAC
ATTGACGACA AACTTGTGCG GAGCTGTGTA CCGGCTTACA GCATTGATGT ACGTCCAAAC
TTTCTGGATT GGTGGGAATA TATTATTGAT GCACAAACAG GAAACATCCT TTCATCGCAT
TCCAAAACAT GCCATGCCGA CGGTCCGCGC ATAAGTACCG GCAACGACCT CAACGGCGTA
TCACGTACGA TTAATACCTA CCAGACAGGT TCATTGTATT ATACGACAGA TGCCAGCAGA
AGCATGTTTA AATCAAGCCA GTCTTCATTC CCCGACAATC CTGCCGGTGC CATTCAAACG
CTTGATCTGA ACTATACATA TGGTTCCAAT ACCAAATACA AAGCTATTAC TTCCAGCACC
AACAGTTTTA ATGCTACGGC AATTTCGGCA CATTATATTG CCGGCAAGTC ATACGATTAC
TATTCTGCCA TACATGGCCG TACTTCTATT GATGGGAATG GCGGCACAAT TATCTCTTTC
ATAAATGTTG CTGACCCGGA CGATGGAACA CCAATGGACA ATGCGTTCTG GAATGGTAAA
GCCATGTATT ACGGCAATGG AAATACAAAC TTCAAACCCC TGGCCGGCGG CCTGGATGTA
GGCGGGCATG AACTGACGCA CGGTGTGATC CAGAATTCTG CCAACCTCAA TTACCAGGGC
GAATCCGGTG CCATCAACGA ATCAATGGCG GATATTTTTG GCTGTATGAT TGATTCTCTG
GATTGGAAAA TCGGTGAAGA TGTTGTACTT CTAAGTAAGT ACCCTTCCGG TGCCTTACGT
GATTTATCCA ACCCGCACAA TGGCGGCACA AATATAAATT CAAGAGGCTG GCAGCCTGCA
CATGTATCTG AAAAATACTC CGGAACACAG GATAATGGCG GCGTACACAT AAACAGCGGT
ATAACCAATT ATGCTTTTTA TTTATTGGCA CAGTCTACTT CAAGAAGTAA GGCTGAAAAA
ATATTTTACC GTGCATTAAC CGCCTACCTT ACGCGTTCTT CTAAATTCAT TGACCTCAGA
ATTGCATGTA TCGCTGCGGC AACCGATTTA TATACATCAA ATGAGGCAAC GAAAACAGGA
ATCGCTTTTG ACCAGGTAGG GATTACAGGA AACAGTGAAG TACCTACAAC ACCTGTATCA
AGCAACCTGC CAGTGAATAC AGGCGATGAA TACTTGCTTA CGTATAATCT GAACACAACC
TACAGCACGA AATTATACCG CATAAATACG GCGACACAAG CTTATGCTAC CATCAATACA
AGTTCGGTAT TCAACAAGCC CAGCATAACC GACGATGGTT CAATGGCGTA TTTTGTAAAC
ACGGCCAACC AGCTGAAAAG CCTGTACCTG ACACCGGGCA ATACGTACGA ACAGATTATT
CAGGACGAAC CTATCTGGAA CAATGTTGCA ATCAGTAAAA ACGGCAAACG TTTGGCCGCA
ACAACAACCG ACAAGGATAC ATCTGTTTAT GTATATGATT TCGATAGTGA TACCTGGGCA
CAATTTGTGT TGTACAACCC TACCTATTCA GAAGGAATTA AATCAGGCGG GCCTATCTAT
GCGGATGCAT TGGAATGGGA TCACACAGGA GAGTTTTTAG TATACGACTG TTATAACGAA
TTTGAAAATA CATCCGGTAA CAACATCAAT TTCTGGGACA TCAATTTTAT TCAGGTGTGG
GATAATACCC TGAATGATTT TGGAGACGGA ACTGTAACAA AATTATTTTC ATCGCTGCCG
GATCACATAA GTGTGGGTAA TCCTGCGTTT GCAAAAAATT CTACCAATAT CATTGCCTTT
GATTACATAG ACGAAGATGA AGGTGATTTA TATGTTATCG GCTGCAATAC AGAAACAAAT
GAACTGGATG TTATTACAAG CAGCAATGTA CTTGGTTTTC CGAATTTTAA CAGACTTGAC
AATAAGATTG CTTATCTCTA TGAATTAAAT TCAGACCATA TGAAATCTAT CTGGACAGTT
GATCTGGATG AAAGCAAAAT TACAGCGCTG CCAAACGGTT CAGATACATA CTATACAGAC
AAATCAAACT GGCCGGTTTA TTATGCAACC GGCGTACGAT CCCTTCCCTC TTCTACTACA
GCAAAGCACA CGAACACTTC GGAGAATCGT GCAAACGTAT ATCCGAATCC CGCATCTGCA
GATTTCAGCA TACGGTTAAC GTCCGGTGAC CAAAGCCATG CCGTTATACA TATCAACAGT
ACAACCGGCC AGCTCATTTA CAGTACTGCT GCCAATCTGC TCACAGGGGA AAACACCATT
CCTGTTCAGC TGCCAGCATC TGTTGCTTCA GGATATTATA TTGTAACCAT TGAAACTGCT
GATGAACGCT GGGTAAGCAA GCTGATTAAA AAATAA
 
Protein sequence
MKKPYSTLFL LLLTATHTFA QTSFDVKEKQ SNPAAHRSFE ATRIKGNMLQ QSILTAQGTS 
LLPQLSKPNT AAVRATAKNP FSVIYSNETG LPIFIKTIIP QTLQQRAVGT GSGIAIAYNY
IDQLRETLGL TDVAEPFTPY KTEKDLLGGQ IIRLKQFYNG IEIDGCESIV HINASGQAVS
WNGSYIKPDL IKHTSFAVTP AAAAAKALAD IKTHAHYVEL SEQEQQFLNY STPGIKQIYY
IDDKLVRSCV PAYSIDVRPN FLDWWEYIID AQTGNILSSH SKTCHADGPR ISTGNDLNGV
SRTINTYQTG SLYYTTDASR SMFKSSQSSF PDNPAGAIQT LDLNYTYGSN TKYKAITSST
NSFNATAISA HYIAGKSYDY YSAIHGRTSI DGNGGTIISF INVADPDDGT PMDNAFWNGK
AMYYGNGNTN FKPLAGGLDV GGHELTHGVI QNSANLNYQG ESGAINESMA DIFGCMIDSL
DWKIGEDVVL LSKYPSGALR DLSNPHNGGT NINSRGWQPA HVSEKYSGTQ DNGGVHINSG
ITNYAFYLLA QSTSRSKAEK IFYRALTAYL TRSSKFIDLR IACIAAATDL YTSNEATKTG
IAFDQVGITG NSEVPTTPVS SNLPVNTGDE YLLTYNLNTT YSTKLYRINT ATQAYATINT
SSVFNKPSIT DDGSMAYFVN TANQLKSLYL TPGNTYEQII QDEPIWNNVA ISKNGKRLAA
TTTDKDTSVY VYDFDSDTWA QFVLYNPTYS EGIKSGGPIY ADALEWDHTG EFLVYDCYNE
FENTSGNNIN FWDINFIQVW DNTLNDFGDG TVTKLFSSLP DHISVGNPAF AKNSTNIIAF
DYIDEDEGDL YVIGCNTETN ELDVITSSNV LGFPNFNRLD NKIAYLYELN SDHMKSIWTV
DLDESKITAL PNGSDTYYTD KSNWPVYYAT GVRSLPSSTT AKHTNTSENR ANVYPNPASA
DFSIRLTSGD QSHAVIHINS TTGQLIYSTA ANLLTGENTI PVQLPASVAS GYYIVTIETA
DERWVSKLIK K