Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2237 |
Symbol | |
ID | 3521332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 2337258 |
End bp | 2339261 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637284694 |
Product | collagenase |
Protein accession | YP_268962 |
Protein GI | 71280863 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.971967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC TAACAATACT TCTTCTGACT CTCGCTATAT CTGTTGTAAT TACTGCGTGT AGTGTTAACG ATGAGGCTGA TACTACTAAG GTCATGCAGT CTGTCTTAGT ACCAATAGTA AATCCAATGG TTGCTCAAGT ATTAGCGACT GATATAGAGC AGTTATGGAG CCAAGATTTT AATCATTATC AACAGGGATT ACTAAAAGCT GTTGCGGATG AAATATCAAT GGAAGCTTTA AAAGGAGACT TAACTAATGA TAAATTAGAG AAGCTTACTT TTTATCTTAG AATTTACAGT AGCTTTGGTG CAGATAAATA TTGGACCGAA GAAACCGCAA TATCTGTTAA TAGCGCATTA GACAATTTAT ATAATATGCC TGGTTTTTTT GAGGTGAGTC AGACTACAGC ACGCTTGCAT GAAAATTACG CTGTTGCTTT GTATCGTTTA TATTTTTTAG CACCTTTACA ACCATTTATA GTAGAGCAGG TAAAGCCGTT AAGTCAGTTA ATTAACCTGT ATGCTTCCGC TGATCTTTCT AATACAACTA CAGTGAACTC TGATAAAGAC ACAGCGATAG ATTATGCACT GTGGGAAGTA TTACGTGCTG GCGCTATTTT ACCCTACGAA GCCCGAAGAA AAAATACTGC TGAATTCATG AAAGGTGTAC ATGGTGAAGG TGAACTTCAA CAGGCATTAA TTCAATTTAT CACTGCTAAA AATAGTACTT TAGTGGGGGA TGACTGGCCT AAGCAACACG CTCTTTGGGC ATTGGCACAG TATTATAATT TATATACGAA AGAATATTGG AATGACTACT ATGAGCACTC AGCTGAGGAT CAGAAACGTT TAGATGACGA TAAATTAACC CTCAAGATTG AAGGTGAAAT GGATACGCTT GATAACAGTG TATGGGCGGC GTTAACCAAT GATAAAGCAA CATCAGTAGA GCAAAATAAG ACACTTTTTA GTGTGCCTTA TGTCGTCAAC ACTTTTCGTG GAAAGTCTGA ATGTGAAGAG GGTACGCTAG TCGATCGCTG CATTTCCCCA TCAATTGAAC AAGCATTGCC TATTAACCAT GAGTGCTCAA GTAAAATATA TATTTTAGCC CAAGCCATGT CTTCAGCACA GTTAAGTGAT GCTTGTCAGC AGTTAATCGC TCAAGAAAGT AATTTCCATG AAATATTAGC GACTAATAAT CAGCCCGTTG CCAACGATTT TAACGATAAA TTACGAGTCG TTATTTTTGA TAATCACGCA GAGTATAATA AATTCGGCCA GCTAATTTTT GATATTAATA CCGATAATGG TGGTATGTAC ATTGAGGGAA CGACACAAGA TCCTAACAAT ATTGCGACTT TTTACTCATT TGAACATTTC TGGGTACGAC CTGAATTTGC TGTTTGGAAT TTAAACCATG AGTTTGTTCA TTATTTAGAT GGACGCTTTG TTAAATACGA TACTTTTAAT CATTTTCCAA GTCATATGGT GTGGTGGTCT GAAGGACTCG CTGAATATGT TGCCAAGGAA GATAATAATC CAAAAACTTT CAAATTAGTC AATGACACAA CTCCAGAAGA CTGGCCCAGT TTAACGGATA TTTTTAATAC TGAATATAAA GACGGTACTG ATAGGGTATA TCGGTGGGGC TATTTAGCTG TGCGCTTTAT GAATGAAAAA CATCAAAATG AATACAGGAA AATGGCGCAC TACTTAAAAA CAGACTTTTT TGATGGTTAT AAAAAATTAG TTGAGGAGTC AGGTAAAAAG TATGCAGCAG AGTTTACTCA ATGGTTAGAT GAACATAATG CTAACTATGT GGCGGAAGAA GATGTAAATA ACCCACATAA ACCACGTCAA TTCTATCGTT ATACGTATAA AGATTACTTA CAGCCAAGTC ATTTAACGGA AGATAAGCTG CATATGCACT GGCAGTATTG GCATGAAAAT GCTTTAAAAT CATTAGATAA AAAATTGGCT AATAAAAATA CTGTGACTAA ATAG
|
Protein sequence | MNKLTILLLT LAISVVITAC SVNDEADTTK VMQSVLVPIV NPMVAQVLAT DIEQLWSQDF NHYQQGLLKA VADEISMEAL KGDLTNDKLE KLTFYLRIYS SFGADKYWTE ETAISVNSAL DNLYNMPGFF EVSQTTARLH ENYAVALYRL YFLAPLQPFI VEQVKPLSQL INLYASADLS NTTTVNSDKD TAIDYALWEV LRAGAILPYE ARRKNTAEFM KGVHGEGELQ QALIQFITAK NSTLVGDDWP KQHALWALAQ YYNLYTKEYW NDYYEHSAED QKRLDDDKLT LKIEGEMDTL DNSVWAALTN DKATSVEQNK TLFSVPYVVN TFRGKSECEE GTLVDRCISP SIEQALPINH ECSSKIYILA QAMSSAQLSD ACQQLIAQES NFHEILATNN QPVANDFNDK LRVVIFDNHA EYNKFGQLIF DINTDNGGMY IEGTTQDPNN IATFYSFEHF WVRPEFAVWN LNHEFVHYLD GRFVKYDTFN HFPSHMVWWS EGLAEYVAKE DNNPKTFKLV NDTTPEDWPS LTDIFNTEYK DGTDRVYRWG YLAVRFMNEK HQNEYRKMAH YLKTDFFDGY KKLVEESGKK YAAEFTQWLD EHNANYVAEE DVNNPHKPRQ FYRYTYKDYL QPSHLTEDKL HMHWQYWHEN ALKSLDKKLA NKNTVTK
|
| |