Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_3909 |
Symbol | |
ID | 3523027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 4081078 |
End bp | 4084206 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637286355 |
Product | serine protease |
Protein accession | YP_270567 |
Protein GI | 71282552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTTTA AGAAATCCTT AGTACGGAGC CTGATTACTT TAGCAATCAC AGCTACTGCT AGCACGACTG TTCTTGCCAA TGAAGTTTCA GGTGATATCA GTAAATTCAA ATCTGTGGGA AATGAAGTTA ATACCAAACA AAAAGCAACT GGCTATATTG TTCAACTTAA AGGCAAGACA GCAATTGCTC AAGCTCAAGA AATAGGTGAG CTTTTACCTA CCAACCAATT AGTTGCTAAT ACCGGAAACC GTTATAACGC TCATACTCCT GCAATGGAAG CGTATACTAA AGCGCTTGAG AATAAGCAAA AGCAAGTTGC CAGCAGTATT GATTCAATTA ATATTTTGCA TTCTTTCAAG CATACTTATA ATGGTTTCAC TGCCAAGTTA AATGCAAAGC AAAAAGCGCA GTTAGAATCT CACCCTGATG TTATCGGTGT ATATGAAGAT AAATTAGAAA CAGTTAACAC TGCAAATACT CCTGAATTTC TTGGTTTAAC TGGTGCTGGC GGTCAACACG CCATGAATAT TAAAGGTGAA GGCGTCATTA TCGGTGTTAT CGATACAGGT GTTTGGCCTG AAAACCCTAG TTTTGCTGAC GATGGTTCTT ATTCAGACCC TGCAGACTTA GGTTGGTTGG GTTCATGTGA TACAGGGACT GATGAAGAGT TTGCCTGTAA TAACAAATTA ATTGGCGCAA AATATTTTGA TTCAAGCTTT AGTAGCCAAT ATGATATTCA ATATGACCTA GGTGAATTTG ATTCTCCTCG CGATGCTGAT GGTCACGGTA GTCATACTGC AAGTACAGCA GGTGGTAATG AAAGTGTAGC TGCTATGCTT TCTGGTACAC CGGTAGGTAC AGTTTCAGGC ATGGCGCCAC GTGCACGAAT TGCTGCTTAT AAAGTTTGTT GGAACAGCGA TTATAAAAAC CCTGAAGGCG GTGATGAAGC GGGTTGTTTC GGTGGCGATA CTATGGCGGC AATCGACGCT GCAGTTACTG ATGGTGTTGA TGTAATCAAC TATTCTATTG GTGGTAGCAG AACAGATTTA ACAGTACCTG CTACTGCAGC AATGTTAAAT GCAACGGCTG CTGGTGTATT CGTTGCTGTT TCTGCTGGTA ATGATGGTCC TGATAAAGAA ACTGTCGGTA CTCCAGCTCC TTGGGTGACA AGTGTAGCAG CATCTACTTA TAATGGTACT TCAGCTATTG TTGGTAAAGC GCTTGATATT ACTTCTGGCA CTTTAGCTGG CTCTTCAATC TTATCAGTAC CTTCTGGATT TTCTCCAGCA ACTGTAGGCC TTTCAGGTGA ACTTGCGTTA GCAGAACCAG TACAAGCATG TAATGATGCT CCATTGACTA ACGGCGAAGA TTTAGCCGGA AAAATTGCTC TTATTGCTCG TGGCTCTTGT GCTTTTACTG AGAAGTTTCT CAATGCACAA AATGCAGGTG CGGTAGGTGC TATTATTTAT ACTACAGAAG GTACATCGCC ATTTTCTATG GGCGGAACTG ATCCTGCAGT AACCATTACA GGTTCAATGA TTTCTTTTGC TGATGGTCAA TCGTTAACGG CAAGTATTGA AGATGGAAGT ACATCGGTTG CATTTACTGA TAATACAGCC GCGGGTGAAG CAGTAGAAGT TGGCAATACT ATGGCTGATT TCTCTTCACG TGGTCCAAAC TTAAATACAT ACGATATTAT CAAACCTGAT ATTACCGCTC CTGGTGTAAA AATATTGGCG GCAACAACTT CTGCACCAAT GTTCGGTACT CAAGGTGAAA CATTTAAGTA CCTGCAAGGT ACTTCAATGT CTAGCCCACA TATTGCCGGT TTAGCTGCAC TGTTTAAAGA ATCAAACAGT TCATGGTCTC CAGCACAAAT TAAATCAGCG ATGATGACAA CAGCTCGTCA AAACTTAACT AAAGAAGATG GTACTACCCA GGCAGACCCA TATGATTTTG GTTCTGGTCA TGTAGCTCCA GTTTCTGCTT TAGACCCAGG TTTATTGTTT GATACTAATC TTGCTGATTA TTTAGCATTT CTTTGTGGTC AAGACAAAGA AGCTTTTGTT TCTGGTTACG ACACAAGTTG TGCTGACTTA GCAACTGCAG GCTTTAGTAC TGATGCTAGT CAATTAAACC TAGCTTCAAT TGCTATTGCA GAGTTACTAG AACCTGAAAC AATTTTCAGA ACTGTTTCTA ATGCAACACC AATCGCTTCA TCTTACACTG CAACAGTTGA AGCCCCTGCT GGTTTTGACA TTAGTGTTCA AACCTTTGAT GCTGCTGGTG AAGAAACTGA AGCTTCAACA TTAGATGTTG CTGCTGAAGG CGGGAAAGCT AGTTTTGCAA TTACAGTTAG TCAAACTGAG ACTACTGAAA TTGAAGCTTG GAAGTTTGGT GCAATCACTT GGACAGACGG TGCTGGTCAT TCAGTACGTT TACCATTAGC AATTAAAGCG ATACCAAGCG TTCAAATTGA AGTACCTGAA CTAATCTCAG GTGACCTTAA CCGTGGACGT TTCCGCTTCC CTGTTAAAAT GCTTTATTCT GGTAGAACAA GCATTAAACA TGCTGGTTTA GTTGCTCCAT TTGGAACAGC GGGTACTGTT GAAGCTGATC CTGCACAAGA ATTTGAATTC TTAGGGGCTG GTACTAATTA CCACTTATTC CATATTCCAG AAGGAACACA AGTAGCACGC TTTAGCTTAT CTGATGCTCT AGTCACAGAA GAAGGTAGCG ATCTTGATTT ATACGTTTAC CGTTGTGATA AATGGAGTTG TGCACAGGTA GCTAACTCAT TAAACGGTGG TTCAAACGAA GATGTTGTAC TAACAAACCC TGAACCGCGT GCAGATGTAG ATGTTGGTGA TGTTTATGTA ACTATGATCC ACGGTTATTC AACGGGTGCA GCAACAGAAA CAGATTATAC TATGGTAGGT TGGATTGCAG ATCAGGCTGA AAGAACTACT CGTGTAATCT CAAGCCGAAG AGCAATTAAC GGTCGCTTTA ACTACACCAG TATATTAACT CGAGGTCTAC CAACAGGTAC TACCTATATG GGTGCTGTTA CGTACTTCAA TGCTGAAGGT GAAGCTGAAG GTACTACTGT ACTTGAGTTA AAAAACTAG
|
Protein sequence | MHFKKSLVRS LITLAITATA STTVLANEVS GDISKFKSVG NEVNTKQKAT GYIVQLKGKT AIAQAQEIGE LLPTNQLVAN TGNRYNAHTP AMEAYTKALE NKQKQVASSI DSINILHSFK HTYNGFTAKL NAKQKAQLES HPDVIGVYED KLETVNTANT PEFLGLTGAG GQHAMNIKGE GVIIGVIDTG VWPENPSFAD DGSYSDPADL GWLGSCDTGT DEEFACNNKL IGAKYFDSSF SSQYDIQYDL GEFDSPRDAD GHGSHTASTA GGNESVAAML SGTPVGTVSG MAPRARIAAY KVCWNSDYKN PEGGDEAGCF GGDTMAAIDA AVTDGVDVIN YSIGGSRTDL TVPATAAMLN ATAAGVFVAV SAGNDGPDKE TVGTPAPWVT SVAASTYNGT SAIVGKALDI TSGTLAGSSI LSVPSGFSPA TVGLSGELAL AEPVQACNDA PLTNGEDLAG KIALIARGSC AFTEKFLNAQ NAGAVGAIIY TTEGTSPFSM GGTDPAVTIT GSMISFADGQ SLTASIEDGS TSVAFTDNTA AGEAVEVGNT MADFSSRGPN LNTYDIIKPD ITAPGVKILA ATTSAPMFGT QGETFKYLQG TSMSSPHIAG LAALFKESNS SWSPAQIKSA MMTTARQNLT KEDGTTQADP YDFGSGHVAP VSALDPGLLF DTNLADYLAF LCGQDKEAFV SGYDTSCADL ATAGFSTDAS QLNLASIAIA ELLEPETIFR TVSNATPIAS SYTATVEAPA GFDISVQTFD AAGEETEAST LDVAAEGGKA SFAITVSQTE TTEIEAWKFG AITWTDGAGH SVRLPLAIKA IPSVQIEVPE LISGDLNRGR FRFPVKMLYS GRTSIKHAGL VAPFGTAGTV EADPAQEFEF LGAGTNYHLF HIPEGTQVAR FSLSDALVTE EGSDLDLYVY RCDKWSCAQV ANSLNGGSNE DVVLTNPEPR ADVDVGDVYV TMIHGYSTGA ATETDYTMVG WIADQAERTT RVISSRRAIN GRFNYTSILT RGLPTGTTYM GAVTYFNAEG EAEGTTVLEL KN
|
| |