Gene CHU_1338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1338 
Symbol 
ID4185843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1561612 
End bp1563477 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content40% 
IMG OID638071332 
ProductU32 family protease 
Protein accessionYP_677950 
Protein GI110637743 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.138533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.134167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAGTTGAAAT ACTTGCTCCT GCCAAAAACC TATATCAAGG AATGGCAGCT 
ATCAATGCCG GAGCAGATGC TGTATATATT GGCGCACCTC AATTCGGAGC ACGAACCAAT
GCAACCAATC CGGTTGAAGA CATAGCAGAG CTTGTGCGTT ACGCACACTT ATTCAAAGCT
CAGGTATTTG TAGTATTAAA CACTATTTTA TACGACAACG AACTAGACAC CTGTGAAAAA
CTCATTCACG AGCTGTATCA TATTGGTGTA GATGCATTGA TCATTCAGGA CATGGCCATT
ATGGAAATGA ACATCCCTCC TATTGTGATC CATGCCAGTA CACAAGCCAA TAACCGCGAT
CCGAAACATG TAAAGTTTCT GGCAGATGCC GGCATGAAAC GCGCCGTGCT GGCACGCGAA
TTAAATTTAG ATCAGATCAG AGACATTGCT GAAGCAACGG ATGTTGAACT GGAATTTTTT
GTTTCCGGAG CCTTATGTGT ATCCTTCAGC GGCAATTGCT ACATGAGTAT TGCCGGCGGA
GAGCGCAGTG CTAACCGCGG CTCGTGTGCA CAAAACTGCC GCTTGCCTTA TAACCTGATT
GATGGTACAG GTAAAACACT TATTGCAAAC AGCCATTTAT TATCTATCAA AGATCTTGAC
TTAAGCGATC AATTACCAAA TCTCATTGAA GCTGGTATTA CTTCGTTTAA AATTGAAGGC
CGTTTGAAGG ATATTGTCTA TGTAAAAAAC AATGTATCCT ACCTGCGCAA AAAGCTGGAT
GCGTTCCTTG AAAATAACGA ACGTTTTGAA AAAGCTTCCT CCGGGCGCAC ATTCTACAAT
TTTGATGCTG AAATGGATCG CAGCTTCAAC AGAGGTTATA CCGACTATTT TGTAAACAAA
AGAACAGAGC GGATCGGCTC ATGGGACACA CCAAAATCTC AGGGACAAGT AATCGGTAAA
GTTATTGAAG TAAAACATAA CGGTTACGTC ATTGAAAATT CAGATAAACT AAACAATGGG
GATGGTTTAT ATTTCATAAA TGAAGCCGGT GAAGCCGATG GCGCGCAAAT AAATACAATC
ACCAATAATG TAGTTATTCC AAATACCTTT AAGCCAATTA AAGTCGGCAC AATGATTTAC
CGGAATGCCG ATGCGGAATT CAATAAATTG GTTGAACGGG AAGACAGTGC GATCCGTAAG
ATCGGCGTAT CGCTGCTATT CAGTGAAGTA CCTGAAGGTT TCCAGCTTAA AGCAATTGAT
GAAGACGGAC ATGAAAGTAT TTCAACACTT GACGTTCAGA AGGAATTAAG CAAAAATGGC
GACGGCGTTA TAGACAACAT TAAAAAAAAT CTGGCTAAAA CAGGAAATAC ACCGTTTATC
GTTGACAAGC TGGACGTAAC GCTTTCAGCA AATTGGTTTC TGCCTATTTC AAAAATAAAT
GAGATCAGAA GAATTGTATT AGAAGAACTG ATTGATGTAC GTGTTGCTTC ATACAATCGT
AAAGAATATC AAATCAAAAA AACGGATCAT CCATATCCGG TAGAAAAACT CGATTTCATG
TATAATGTAT CCAATAAAAT GGCCAGGACA TTTTACCACA GACATGGTGT TACTGAAATT
GAAAAAGCAT TTGAATTACA ATGGGACCCG GGCAAGGCAC GTGTAATGAC AACCAAATAC
TGCGTAAAAT ATGAATTAGG CAAATGTGCA CGCTATCAGC GCGACACCAT GGGCGAAAAA
GTTGTCGAGC CTTTAGTATT AAAGCATGGT GAAAATGAAT ACAAACTTAA ATTCAATTGT
AAACCTTGTG AAATGGAGAT CTGGGAAAAG GATGCCGATC TCGTTTTTGA TGAAGATGAT
TATTAA
 
Protein sequence
MKKKVEILAP AKNLYQGMAA INAGADAVYI GAPQFGARTN ATNPVEDIAE LVRYAHLFKA 
QVFVVLNTIL YDNELDTCEK LIHELYHIGV DALIIQDMAI MEMNIPPIVI HASTQANNRD
PKHVKFLADA GMKRAVLARE LNLDQIRDIA EATDVELEFF VSGALCVSFS GNCYMSIAGG
ERSANRGSCA QNCRLPYNLI DGTGKTLIAN SHLLSIKDLD LSDQLPNLIE AGITSFKIEG
RLKDIVYVKN NVSYLRKKLD AFLENNERFE KASSGRTFYN FDAEMDRSFN RGYTDYFVNK
RTERIGSWDT PKSQGQVIGK VIEVKHNGYV IENSDKLNNG DGLYFINEAG EADGAQINTI
TNNVVIPNTF KPIKVGTMIY RNADAEFNKL VEREDSAIRK IGVSLLFSEV PEGFQLKAID
EDGHESISTL DVQKELSKNG DGVIDNIKKN LAKTGNTPFI VDKLDVTLSA NWFLPISKIN
EIRRIVLEEL IDVRVASYNR KEYQIKKTDH PYPVEKLDFM YNVSNKMART FYHRHGVTEI
EKAFELQWDP GKARVMTTKY CVKYELGKCA RYQRDTMGEK VVEPLVLKHG ENEYKLKFNC
KPCEMEIWEK DADLVFDEDD Y