Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2124 |
Symbol | |
ID | 3520571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 2215419 |
End bp | 2218001 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637284582 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_268850 |
Protein GI | 71280108 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.118158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTGGT GGTTACTGAC ATTTTTTCTT GGTGCTATAT TGTCCCTATT TTTGCAGGAA GTGCCAGCGC TTTTTCAGCT ATTTTTACTT CTTTGTCTCG CTATTGGCTT TTATTCCCAT AAAAAACTAC GTTATAGCTC AGGATTATGG TTTGGCGCTT TATGGATTTT AGCTCAAGCT TACCTCTATC ATAACCTGTT ACCGCCACCT CTTATTGAAC TAATGGAAAA TAAACAAGCA TTTTTTATTG AAGGTGAGGT ACTTAGTATC CAAGTTAAGC CACCGATCAT GTTGAGCGAA CATATAAAGG AAAGCGCTGT TCAACAGAAA GCTAATTCCA CTAAACGCTT TAATTTTCTC GTTAATAAGA TAAATCAACA GCTACTTGAA TCCCCAATTA CTATTAGGCT AAGTTGGCAA AAGTCGACTA TTGACCTTGC TCAAGGTCAA AGGCTCTCTC TTAACGTTAA AGTAAAGCCA GCCCATGGAT TAGCAAATAT AGGCACGTTT AATTATTTAA GTTGGCTCAA AGCACATAAC ATCGTCGCAA CAGGCTATGT GGTTAATCCG AGAAAGAAGA AAAATTCAAA TTATCAAGAA GCGGAAGACT TAAAAAGCTT AAAAGCGAAT AAATTGCTAA TGGCAAATAT CACAATGAGG CAAGCACTAT TTGAGCACTA TCAAAGCCTT ACTCCAAATC ATAAGCTTAC CCCCATTTTA TTGGCATTAG CCTTTGGGGA GCGTAGTTCA CTTAATACCG AGCTTTGGCA GACACTACAA GTGACAGGTA CTAGTCACCT TATTGCTATT TCGGGTTTAC ATATAGGGTT ACTTGCTGGT AGTGCGTTTT TTATTGTGAT GTTTTTTTTT CAATATATCC CATTGAGAAA TCCTTGCTGG CAGCACATTA ACAGCCGTTA TATTGCTATT GCTGTGAGTT TGTTACTTGC AACTGCGTAT GCATATTTAG CGGGTTTCTC TCTTCCAACC CAACGTGCTT TGGTGATGTT AAACTTGTAC TGGTTAAGTC GCGTTGTTGG CATAAAATTC TCTGCGAAAC GTTTAATTTT AGTGACAATC TTTATTTTGT TGATTATCAC TCCTTTTAGC TTATTAACCG CTAGCTTTTG GTTATCTGTT TATGCTGTGG CTATTATCTT TGTATCACTA TGGCGATTTA AAGCATGGAT GAATAAGGGT CCATATCTTT GGCGTTTCTT CAAAACGCTA TTTATTATTC AAGTAGCACT TACCGTGATG TTAATGCCGA TAACTGCATT ATTTTTCCAA AAGATATCCT TAGTTAGTTT ATTTGCTAAT ATCATTGCTG TGCCTTGGAT GAGTGCTTTT AGCATACCTA CAGCACTCAT GTCAGTGGTC TTGATACCGA TAAGTGAGTC ATTAGCGCAA TGGTTTATGA TGTTATCTCT GCAATCGTTA ACCTGGTTGT GGTTTTACCT TGATTTACTT AGTGAGCTAC CTAATGCAAT TATTTCACTT TCATTCGTTC AACAAATGAT TGTACTGTTA GTGGGTAATG CTGCTTTTTC AATACTTTAC CTGTCGCCAT GCCTCTGGAC TAGGGGGGGT AAACAAATCA CTTTTGTTTT GCTAGCCCTC AGCGCGATTG TGTTTAGTTA TCATGAGCCA ATTATGTCCT CTTTAAATAC TTACGCTTGG TCAAAAGAGG TAAAAACGCC TAAGGATGCC ATTAATTTCC AAGCTGAGTC TGGCTTTAGC TCATGGGAAG TGATTTTTTT TGATGTTGGT CAGGGAACAT CGGTGTTAAT TAAACGAGAT GACCAGGCCA TTTTATATGA TACCGGAGCA GCTTACCCCA GTGGTTTTAC TATGAGTGAT GCGGTGATAC TGCCATTTTT ACAATACTCA GCCATTGAGA AGCTAGATAA AGTAATATTA AGTCACAGTG ATAATGACCA TGTTGGCGGC CTTAGGACCC TGATAGAGCA TATATCGATT GATGAAATAA TCAGTAATGA TAAAACACTC TTTAATTCTA AGGCCTTATC ACTTAATGCT CTCTCAACTA CTACGAATTT AGGCCCTAGT AATAGGCGAC TTACTGACTG TCAGCCACGT AATAGTTTTT CTTGGCAAGG GTTAAGGTTT GACATCTTAT GGCCTTTAGC TTGGGATCCT TCCAATGAAA GTGTTAATAG GGGCAAGCAA AAAAATGATG ACTCTTGTGT TATCTTAATT AGTGATCAAT TGGGAACTAC ACTGCTTTTA ACCGGAGACA TTTCCTCGAA AGTAGAGCAG AAGTTACTAA AATTTTATCC CCAGCTTAAT GCTGATATTT TACAAGTACC TCATCACGGT TCTAAAACAT CATCAAGTCA GGCATTTCTT AGTCAATTAT CACCTGACGT TGCGTTAGTT AGTGCAGGTT ATTTAAATCG TTGGCATATG CCCGTCGCTA TTGTTCGTCA GCGTTACCAT GATAGTAAAA TTCAATTGTT AAATAGTGCG GAGCTGGGGC AAATCATCAT AACCGTTGAT GAAGAGGGAA TGAGCACACA AAGCTTTACT GAAGATTTAC GCCCGTTCTG GTTTAGTCAT TAA
|
Protein sequence | MDWWLLTFFL GAILSLFLQE VPALFQLFLL LCLAIGFYSH KKLRYSSGLW FGALWILAQA YLYHNLLPPP LIELMENKQA FFIEGEVLSI QVKPPIMLSE HIKESAVQQK ANSTKRFNFL VNKINQQLLE SPITIRLSWQ KSTIDLAQGQ RLSLNVKVKP AHGLANIGTF NYLSWLKAHN IVATGYVVNP RKKKNSNYQE AEDLKSLKAN KLLMANITMR QALFEHYQSL TPNHKLTPIL LALAFGERSS LNTELWQTLQ VTGTSHLIAI SGLHIGLLAG SAFFIVMFFF QYIPLRNPCW QHINSRYIAI AVSLLLATAY AYLAGFSLPT QRALVMLNLY WLSRVVGIKF SAKRLILVTI FILLIITPFS LLTASFWLSV YAVAIIFVSL WRFKAWMNKG PYLWRFFKTL FIIQVALTVM LMPITALFFQ KISLVSLFAN IIAVPWMSAF SIPTALMSVV LIPISESLAQ WFMMLSLQSL TWLWFYLDLL SELPNAIISL SFVQQMIVLL VGNAAFSILY LSPCLWTRGG KQITFVLLAL SAIVFSYHEP IMSSLNTYAW SKEVKTPKDA INFQAESGFS SWEVIFFDVG QGTSVLIKRD DQAILYDTGA AYPSGFTMSD AVILPFLQYS AIEKLDKVIL SHSDNDHVGG LRTLIEHISI DEIISNDKTL FNSKALSLNA LSTTTNLGPS NRRLTDCQPR NSFSWQGLRF DILWPLAWDP SNESVNRGKQ KNDDSCVILI SDQLGTTLLL TGDISSKVEQ KLLKFYPQLN ADILQVPHHG SKTSSSQAFL SQLSPDVALV SAGYLNRWHM PVAIVRQRYH DSKIQLLNSA ELGQIIITVD EEGMSTQSFT EDLRPFWFSH
|
| |