Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_3335 |
Symbol | |
ID | 3518641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 3457409 |
End bp | 3460360 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637285783 |
Product | serine protease |
Protein accession | YP_270010 |
Protein GI | 71277763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA TACTCCCACT CATGTGGGTA ATTATACTAA TGTTCGTGAT CTCCCAAGGC GTTAGCGCCA AAGGAAAAAA AGAAATCCCC AAATCAGAGT ATGGCTCCTA CATTGTTATT ATGGACCTGA ACCCTGCAAT TGCTTATGAG GGCGATATCA AGGGCTTTAA GGCCACTAAA CCGGGCAAAA ATAAAAAGAT AAACCCCAAA AGCGCTAATG TTCGTAAATA CACCAGCATG CTCAGCAAGA CCCACGACGC GGCTCTTGCA AAGGCTAATG TCAAATCTAA AGACAAGGTG CATGACTACG GCATCGCTCT TAACGGCTTT AGTGCCAAAA TGACTCACGA GCAAGCCGTA GCTCTTTCCT CGCAGGATGG TGTCGCGAAA GTTATGCCCG ATGTATTGCG CCAGAAGATG ACGGATAACA GTCCCAGTTT TCTTGATTTA GGTGGTCCGG CGGGTCCCTG GCTTAAAGGT TATGACGGCG AAGGCATCGT TATCGGCGTA ATTGATACCG GAATCTGGCC AGAACATCCG TCGTTTACTG ACGACGGATC TTACAGCACC CCCCCTATTT TACTTGATGA CTCGCGTCCA AATTGCGAAT TTGGCAACAC GGGACACCGT CCAGATGATG TTGCCTTCAG CTGTAACAAC AAACTGATTG GTGCCAGACA GATGCTGGAC ACTTATCGCC TTATCGTGGG CGCAACGAGT GATGAGTTTG ACTCTGCTAG AGACGAAGAT GGACATGGAA CTCATACTTC TTCAACATCA GGCGGAAATG CAAACGTCCC AGCGAATATG TTAGGCAATG ACTATGGTCT GATTTCAGGC ATTGCCCCAA GAGCTCACAT CGTTATGTAT AAAGGGCTTG GTGATCTGGG CGGTTTTGGA TCTGACCTAG CAGCTGCTAT TGACCAGGCA GTAGCTGACG GTGTAGACGT TATCAACTAC TCCATCGGAT CAAGTAGCTT TGCAATCGGT CCCGATGATG TTGCCTTTTT GTTTGCCGAG AATGCCGGAG TGTTTGTTGC GACATCGAAT GGTAACAGCG GTCCAGCCCC AGCTACAACA GGTTCCCCTG CCTCAACACC GTGGGTAACC TCTGTGGGTG CCAGCACCCA GAATCGAACT TATCAGGGCT CGGCCTCTTC AGTCGGAGAG TGGGAATTTT TTGGAGCTTC AATCACTGCA GGCACCGCTG AATTAGCACT AATCGATTCT GCAGAAGCGG GGAGTGAACT ATGCATTCCC GGTGTTTTAG ATCCAGTGGC AGTGGCCGGG AAAATTGTCC TGTGTCTTCG GGGGGCTATT GCCCGTGTAG ACAAAAGTAA GGCAGTAAAT ATTGCTGGTG GCGCAGGCAT GATCCTCTAC AATGCCAACG ACGGCGAGAG TCAAGTTACT GATTCACACT GGGTTCCCTC TGTGCATATC AACAATACAG ACGGTCTAGT TATCAAAGGC TATATCTCTA ACGATGCCTC AACTGCGGTT GCCCAAATAA TGGGTGGCAC CTATACAGAA ATAGACGCAC CTTCAATGGC TGGCTTTTCA TCTCGAGGCC CCAACCTGTT ATCTGGGGAT ATTATCAAAC CAGACGTAAC AGCTCCCGGT GTTAACATCA TTGCGGGCCA AACACCCGCA TCCGAAGGCC GTGGTGAACT GTTTCAGATG ATATCCGGAA CGTCCATGTC CAGCCCTCAT GTGGCTGGTT TGTTTGCAAT GATCAAACAA GCTCACCCTA ACTGGTCGCC ATCTACCGCC AAATCGGCCC TGATGACCAC CGCGTATCAG GATGTGATGA AGGAAGACGA AGCGACTCCA GCTGATGCGT TTGATATGGG CGCTGGTCAT GTTAACCCCG GCGGCAAAGC AAACAAGGGA TCGATATTTG AGCCCGGCCT TGCCTATCAA GCAGGTTTGT TCGAGTATGC AGCTTATAGC TGTGGCGCCG AGCTCGGCAT ATTTAGTCCG GGAACCTGCG GCTTTTTGGA ATCCTTAGGG ATTCCTACTG ACCCGGCTAA TCTCAATCTG CCTTCTATCG GTATAGCCAA TGTTATCGGC AGCAAAACCG TTTATAGATC TGTTACTGGG GTCGCCAAAG ATAGCGGTTG GAGAACTTAC AGCGTTGACG TTGATGCTCC TGCTGGATAT GAGGTTTCGG TGTTGCCAGC CAGCATAAAA CTTAAATCCG GTATGTCGGC AACTTATGCA GTTACCATTA CCAACACGGC ATCTCCTGCA GGCGAGTGGG CCCACGGCTC CATCACTTGG AGAGATTCAA ATGATCATTA TTCCGTGTAC AGCCCAATTG CAGTCAAGGG GGCTCTATTT GAAGCACCTG CTAACATCAC TGGAAGCAGT GAGACTGGCA GCGCAAGCAT TGACGTGACT TTCGGCTACA CTGGTGATTA CACGGCCAGC GGCTATGGTT TGACTGCAGC CACTGTTGAT ATCGACAGCG TTGTTCAGGA TCCGGATCAA ATATTTGACC CAGGCGATAC ATTCTCTAAC GCACACGCTA TTGTAGTAAG TGGTGCGGCG TATTTACGAA TTGCGATCCC CGGCGTAGCT GATCCGAACG CAGATCTGGA TATATTCCTT TTGGACTCTG TTGGTAATAT CGTTGGCGTG AGTGCCAATG GTGGTACCGA TGAGTTGATT GAAATGGAAC TTCCAGGAGA TGACACCTAC ACCCTTTGGG TTCACGGTTG GTCTGCCCCA GGCGGGAGCA CTGACTATGA ACTTTACAGC TGGGTAGTTC CCATGGCTAG CGGCAGTCTG ACAGTTGCCA GTGCACCCAG TTCAGCAACA TTGGGTGCGA CGGAGACAAT TGGCGTAGAC TGGACCGGTG CAACCAATGG AAAATGGCAT TTTGGAGTAA TAGGTCACTC TGACGCAGGA GGCCTGATAG GCGCTACCTT AGTTGAAGTA GATAACCGCT AG
|
Protein sequence | MKNILPLMWV IILMFVISQG VSAKGKKEIP KSEYGSYIVI MDLNPAIAYE GDIKGFKATK PGKNKKINPK SANVRKYTSM LSKTHDAALA KANVKSKDKV HDYGIALNGF SAKMTHEQAV ALSSQDGVAK VMPDVLRQKM TDNSPSFLDL GGPAGPWLKG YDGEGIVIGV IDTGIWPEHP SFTDDGSYST PPILLDDSRP NCEFGNTGHR PDDVAFSCNN KLIGARQMLD TYRLIVGATS DEFDSARDED GHGTHTSSTS GGNANVPANM LGNDYGLISG IAPRAHIVMY KGLGDLGGFG SDLAAAIDQA VADGVDVINY SIGSSSFAIG PDDVAFLFAE NAGVFVATSN GNSGPAPATT GSPASTPWVT SVGASTQNRT YQGSASSVGE WEFFGASITA GTAELALIDS AEAGSELCIP GVLDPVAVAG KIVLCLRGAI ARVDKSKAVN IAGGAGMILY NANDGESQVT DSHWVPSVHI NNTDGLVIKG YISNDASTAV AQIMGGTYTE IDAPSMAGFS SRGPNLLSGD IIKPDVTAPG VNIIAGQTPA SEGRGELFQM ISGTSMSSPH VAGLFAMIKQ AHPNWSPSTA KSALMTTAYQ DVMKEDEATP ADAFDMGAGH VNPGGKANKG SIFEPGLAYQ AGLFEYAAYS CGAELGIFSP GTCGFLESLG IPTDPANLNL PSIGIANVIG SKTVYRSVTG VAKDSGWRTY SVDVDAPAGY EVSVLPASIK LKSGMSATYA VTITNTASPA GEWAHGSITW RDSNDHYSVY SPIAVKGALF EAPANITGSS ETGSASIDVT FGYTGDYTAS GYGLTAATVD IDSVVQDPDQ IFDPGDTFSN AHAIVVSGAA YLRIAIPGVA DPNADLDIFL LDSVGNIVGV SANGGTDELI EMELPGDDTY TLWVHGWSAP GGSTDYELYS WVVPMASGSL TVASAPSSAT LGATETIGVD WTGATNGKWH FGVIGHSDAG GLIGATLVEV DNR
|
| |