Gene CHU_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3033 
Symbol 
ID4184966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3475831 
End bp3477390 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content38% 
IMG OID638073022 
Productserine protease 
Protein accessionYP_679616 
Protein GI110639407 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTA TTATTCGCGC GGTAAGTGCA GTACTGTTTT TTTCTGTTGT TGTTTTACAA 
ACGAATGCTC AAATTCAAAA ATATTGGATT TCATTCAAAG ATAAAGAGAC TGTTGGATAT
AATTATAAAA ACAATTTAAG CCCTCAAACA ATTTTAAACA GAACGGCTTA TTCAATTCCA
TTACATCAAT ACACAGATAT TCCTGTTTCA AAAATATTTA TTGATTCAAT TGCAAAGCTG
GATGTTTTAA TCATTGCAAA ATCAAAATGG CTGAATGCGG TAACTGCAAA CTTAACACGG
GAACAGGCCG AACAAATAAA ACAGATTTCT TTTGTTGCTT CTGTTGAGCC GGTGAATATT
TATTTGGTCG GATCTTCTAC AAATGAGCTG GAAATTGCTC CTGAACTGAT GCATGCTGCC
ATGAAACAAA TGAAATCAAA AGCGTTTAAT GAAAAAGGCA TTGATGGTAA GGGGATCCGT
GTAGGTGTTA TCGATGCAGG TTTTTACAAA TTACATGAAG ATCCGGCAAC AAGTTATTTA
GTGCAGGATA AAAAAATATT GGGACAGCGT GATTTTATTG ATAAATCAAG AACAGACTTA
ATTGTAAATG CAGCAACATC TGCCGACGAC CACGGCAGGC AAGTTGTTCG GATGATTGCA
GGTTATGATA CCTCTATTAA AGCACAATAC GGCATGGCTG TTAATGCATC TTTTTATCTG
GCAAGAACTG AAAACGGCGA AAGAGAGTAC AGGGGCGAAG AAGATATGTG GATCATGGCA
ATGGAGTGGA TGGACAGCTT AGGCGTTCGA TTAATAAGCA CTTCGTTGGG TTATGCTACT
AAAATGGATG ATCCGAATGA CAATTATAAG CAGTCTGAAA TGGATGGAAA GACAGCACGT
ATCACAAAAG CAGCGCAGAT TGCATTTTAT CAAAAGGGAA TTTTCTTAGC TGTTTCGGCC
GGCAATGAAG GCGACACGCA ATGGAGAATT ATTTCGGCAC CAGCTGATGC TGAAGGAGCC
TTAGCTGTAG GTGCTACAAA AGCTTCCACC TGGGACCGCA TTTCTTACAG CAGTATCGGC
CCGGAACCAT TGCCGTATCT GAAACCGAAC GTGTCCTGTT ATTCTCCAAA CGGTACATCA
TTCTCTTGTC CGGCTGTAGC AGGATTTGTT GCTTGTATGA TGAACAATGA TTCAACCTTA
ACCAACGTTC AGTTAAAAGA AATTATTCAG CGTTCAGCGC ATCTGTATCC ATACGGAAAT
AATTTTATAG GTTATGGCAT TCCGCAGGCT GATCGTGCAT TGGTATTAAG CAAGGATCAA
AATACGGATT TTGGGAAAGC CGTTCTGATT CATAATTCGA AAAAGGTTTT TAAACATACA
TTCGACAAAT CCATAAAAGT TGAACTGGTG CTGTTCCATA AAAAAAATGA AACGATTGTA
ATTGATCAGC AAGTAATAAT GGTAAAGAAA GGAAAGCTCA AGATAAAGCG ACCTAAAAAT
GCTGAACGTA CAACAATTGT TGCAGACGAG TTTTTAACGC TTGAAATAAT TTGGGAATAA
 
Protein sequence
MQTIIRAVSA VLFFSVVVLQ TNAQIQKYWI SFKDKETVGY NYKNNLSPQT ILNRTAYSIP 
LHQYTDIPVS KIFIDSIAKL DVLIIAKSKW LNAVTANLTR EQAEQIKQIS FVASVEPVNI
YLVGSSTNEL EIAPELMHAA MKQMKSKAFN EKGIDGKGIR VGVIDAGFYK LHEDPATSYL
VQDKKILGQR DFIDKSRTDL IVNAATSADD HGRQVVRMIA GYDTSIKAQY GMAVNASFYL
ARTENGEREY RGEEDMWIMA MEWMDSLGVR LISTSLGYAT KMDDPNDNYK QSEMDGKTAR
ITKAAQIAFY QKGIFLAVSA GNEGDTQWRI ISAPADAEGA LAVGATKAST WDRISYSSIG
PEPLPYLKPN VSCYSPNGTS FSCPAVAGFV ACMMNNDSTL TNVQLKEIIQ RSAHLYPYGN
NFIGYGIPQA DRALVLSKDQ NTDFGKAVLI HNSKKVFKHT FDKSIKVELV LFHKKNETIV
IDQQVIMVKK GKLKIKRPKN AERTTIVADE FLTLEIIWE