Gene Coch_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_2116 
Symbol 
ID8368577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp2537227 
End bp2538477 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content42% 
IMG OID644984570 
Productpeptidase U32 
Protein accessionYP_003142221 
Protein GI256820942 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00231465 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCATT CAGGAAAAAT TGAACTAATG GCACCAGCTG GTAATTTTGA GTCGCTACAA 
GCGGCGATAG ACAATGGTGC CGACTCAGTG TACTTTGGGG TAGACCAGCT GAATATGCGC
GCAAGGGCAA GTATCAACTT TACGATTGAC GACCTTGATG AAATAGCGCG CCGTTGTGCT
CCTAAGGGCA TTCGCACTTA TCTCACCCTT AATACTATTA TTTATGACCA CGACCTATCT
ATCATCAAAA CACTATTAGA CGCTGCCAAA AAAGCAGGTC TTACAGCTGT AATAGCTATG
GATCAGGCAG TCATAGCTTA TGCTCGACAA ATAGGAATGG AGGTACATAT CTCTACCCAA
ATCAATATCA CTAATATTGA AACCGTGCGC TTCTACGCGA TGTTTGCTGA TACAATGGTA
ATGAGCCGTG AACTGAGCTT ACGACAAATC AAGAAGATAT GTGAGCAGAT AGAAAAAGAG
CAAATCAAAG GACCTTCGGG CAATTTGGTA GAAATAGAAA TATTTGGACA CGGGGCACTT
TGTATGGCGG TATCAGGCAA GTGCTACCTG AGTTTGCACT CACACAACTC ATCAGCCAAT
CGCGGAGCTT GCAAGCAAAA CTGTCGCAAG AAATACACCG TAATCGACCA AGAAAGCGGT
TTTGAGATAG AATTGGATAA CGAGTATATG ATGTCGCCTA AAGACCTCTG CACGATTGAC
TTCCTAGACC AAGTAATCGA CACAGGGGCA AAGGTATTAA AGATTGAAGG ACGTGGGCGC
GCTCCTGAGT ATGTGGCTAC CGTTATTCGC ACTTACCGAG AAGCAATAGA TGCTTATTAC
GCAGGCACAT ACAGTAAAGA AAAATTTGAA AGCTGGATAG AAGCCCTCAA AACGGTGTAC
AATCGTGGTT TCTGGAGTGG ATATTATTTA GGGCAAAAGC TCGGTGAATG GAGTGAAAAC
CCAGGCTCTA ATGCTACCCA AAAGAAAGTG TACATTGGGC AAGGTAAACA CTATTTCCCT
AAGACTGGTA TAGCTGAGTT TGCTATTGAA GCCTTTGATA TAAAGATAGG CGACAAATTA
CTTATCACTG GACCTTCAAC AGGCGTTCAA GAAATAGAGC TGACCTCAAT GATGGTAAAC
GATACTCCTG CTGAAAGAGC TAAGAAAGGT GATTCTTGTA CTATCAAAAC CAATTTCAGA
ATAAGGTTAT CAGATAAACT GTATAAAATA GTAAAAACAA ATATCAATTA G
 
Protein sequence
MTHSGKIELM APAGNFESLQ AAIDNGADSV YFGVDQLNMR ARASINFTID DLDEIARRCA 
PKGIRTYLTL NTIIYDHDLS IIKTLLDAAK KAGLTAVIAM DQAVIAYARQ IGMEVHISTQ
INITNIETVR FYAMFADTMV MSRELSLRQI KKICEQIEKE QIKGPSGNLV EIEIFGHGAL
CMAVSGKCYL SLHSHNSSAN RGACKQNCRK KYTVIDQESG FEIELDNEYM MSPKDLCTID
FLDQVIDTGA KVLKIEGRGR APEYVATVIR TYREAIDAYY AGTYSKEKFE SWIEALKTVY
NRGFWSGYYL GQKLGEWSEN PGSNATQKKV YIGQGKHYFP KTGIAEFAIE AFDIKIGDKL
LITGPSTGVQ EIELTSMMVN DTPAERAKKG DSCTIKTNFR IRLSDKLYKI VKTNIN