Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0140 |
Symbol | |
ID | 4069725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 148330 |
End bp | 150936 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637982140 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_589219 |
Protein GI | 94967171 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0038467 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACTCCC CCGGGCCCAC TGACGTTCCG CGCTCTCCTT TCTGCGGCGG CAAACAACCG CTGTTTCTTA CGGCGTTGGC GTTTGGCTGC GGCATCCTTG CCGCGCGATT TGTCTTTCAC CCCTCAAATG CCTGGCTGAT CGCTGCGCTG CTGGTGGTGG CCTGCGCGAT TGGGTTGCGC AAGTCGCCGA CGGTCGCGTT TGCGGGGATG ATGCTCTCGG CCGCGATGGC TGGTGGGCTG GCGCATGAGC TGCGCGGGCC AGAGGGCGTG GACTATCCGC AGGCGATGAG CGATGGAGAG TGCACCGTTA CGGCACACGT GGTGCGCGAT GGCGTGTTGC AGCGCGGGAT GTTCGGCGGG CAGCAGCAGT CGGTGGACCT GGAGACGGAG CGAGTGCAAC TGGCCGATGG AAGTGGGTTC GGGAGGCCGG TGGGATTGCG GCTGTCGGTG TACGCGCGGA AATCCGATTA CGAGGATGAA GAGCAGCAGC AGGCAGGAGT GCTGCCGATG CCGGTGCTGC GCTATGGGCA GCGGGTACGG CTGACGGCGA AGCTGCATGA GGCGCGGAAC TACAAGAACC CGGGAGCGTG GGACTATCGC GGATATTTGC GGGCGCAGGG CATCGAGTTG CTGGGAAATG CGCGGGTGAG TTCGGTTGAA GTGCTGCCGG GTTTTGGCGG AAGCCGGTGG AAGCAGTCGC AGCATCGGGC ACGGGCGGCG GTGCTGGCGA AGATCCACGA GATCTGGCCG GCGGAACAAG CAGCGCTGTT CGATGCGATC GTGATTGGCG AGCGCTCCGA GCTGGGCAGT GAATTGAAGA CCAGTTTCCA GAGTACGGGG ACGTTTCACA TTCTCGTGGT TTCAGGGATG AACGTCGGGA TTTTGGCGTT TGGATTTTTC TGGCTGTTCC GACGCATCCG CATGGGAGAT GTGGTGGCGA CGATCTGCAC GCTGGCGTCG TCGTTTGCGT ATGCGTGGCT GACAGATTTG GGGTCGCCGA TTTTGCGGGC GGTGTGGACC CTGACGATTT ATCTGCTTGC GCGCTTGTTG TTCCGGCAGA GTTCGCGGCT GAATGCGATT GGCGTGGCCG CGCTGGTGAT CCTGGCTTGG TCGCCGGATG CGTTGTTCGA CGCGAGCTTC CAACTGACGT TCTTGTCGGT GGCGGTGATC GCGGGCGTGG TGGTGCCGTG GATCGAGAGG ACCTCCGATC CTTATCGCAA GGCGCTGCAA AACCTGAATA TCGTGCGCGC GGACCGCGGG TATCGGCCAC AGCAGCAGCA ATTCCGGCTG GATCTGCGGA TGATCGGGGG ACGGCTTGCG CGAGTCCTCC CACACCGGAT CGTGCGGCCA ATGTTGACGA CACCATTCCG CGTAACGTTT GCGGCGTATG AGCTGCTGCT GGTTTCGTTG CTAATGGAGT TCACGCTGGC GCTTCCGATG GCGGTGTACT TCCATCGCGT GACGCTGATG GCGCCGGTGG CGAATGCATT GGTGGTGCCC CTGACCGGTG TGCTGATGCC GGCGTGCGCG GCGGCAGTGT TGCTGGGATT TGTGTGGTTG CGACTGGCGA GATTGCCGGC GATGGTGGCG CTGTGGTCGC TGAAGGCGAT TACGGGTGCG GTGGCGGTGC TGGGGCATTT GCGGGTGTCG ACGGGGAGAG TGGCGACGCC ATCGCTCGTC GTTGCGTTGA GCGCTGCCGT GGCGATTGCG CTGGCGATGT TGCTGGCGCG GCGGAGGTGT TGGCTGGCAG CGGTCGGGAG CCTGGCGATC ATTGCTTCTG GTGCGGCGGT TCTCTTCATC GTTCCGCAAC CGCAGCATAA GTCCGGCGCG CTGGAAGTGA CGGCCATCGA TGTGGGGCAG GGTGATTCGC TGTTGCTGGT GTATCCGAAT GGGCAGTCGA TGTTGCTCGA TTCGGGTGGA CCGCTGGGTG GGTCGCATTC CGACTTTGAC GTGGGCGAGC AGGTGACTTC GCCGTATCTG TGGGCGCGTG GAATCAACCG ACTCGATGTT GTCGCTTACA GTCATCCGCA TTCGGACCAC ATGGGATCGA TGGCGACCAT CATTCGCAAC TTCCGGCCGC GGGAGTTGTG GCTGGGATTT GCGCCGCCGG TGAGGGATGT GGAGAACGTA TTGCAAGCCG CGCGGGAAGA GCATGTGACG GTTCGGTTCT TTCGCACGGG AGATCAGTTT GGTTTCGGTG GGGCGAATGT ACGGGTGCTG CTGCCTCTGC GAGACCAGGA GCCGCACATG CCGCCGAAGG ATGATGATGT TCTTGTGCTG AAGATTGCAT ATGGGAAGAC TTCAGCGCTG CTGATTGGGG ATTCGCATAA AAAGGAAGAG CGGGAACTGA TCGACCTCGC GCCAGAGGCG GATCTGTTGA AGGTGGCGCA TCACGGGAGC AATACGTCGA GTTCGCCGGA GTTCCTGGCT GCGGTGCATC CGAAATTTGG GGTGATTAGC GTGGGGGCGA GGAACTCGTT CAAGCATCCG CGGCCGGAGG TGCTGGAGCG GCTGGCGTCG TTTGGGGTGC AGACCTATCG GACGGATATG GCGGGGGCTA CGACGTTTTA TCTGGATGGA ACGAACGTGA GCGTCGTGCG GAGATGA
|
Protein sequence | MNSPGPTDVP RSPFCGGKQP LFLTALAFGC GILAARFVFH PSNAWLIAAL LVVACAIGLR KSPTVAFAGM MLSAAMAGGL AHELRGPEGV DYPQAMSDGE CTVTAHVVRD GVLQRGMFGG QQQSVDLETE RVQLADGSGF GRPVGLRLSV YARKSDYEDE EQQQAGVLPM PVLRYGQRVR LTAKLHEARN YKNPGAWDYR GYLRAQGIEL LGNARVSSVE VLPGFGGSRW KQSQHRARAA VLAKIHEIWP AEQAALFDAI VIGERSELGS ELKTSFQSTG TFHILVVSGM NVGILAFGFF WLFRRIRMGD VVATICTLAS SFAYAWLTDL GSPILRAVWT LTIYLLARLL FRQSSRLNAI GVAALVILAW SPDALFDASF QLTFLSVAVI AGVVVPWIER TSDPYRKALQ NLNIVRADRG YRPQQQQFRL DLRMIGGRLA RVLPHRIVRP MLTTPFRVTF AAYELLLVSL LMEFTLALPM AVYFHRVTLM APVANALVVP LTGVLMPACA AAVLLGFVWL RLARLPAMVA LWSLKAITGA VAVLGHLRVS TGRVATPSLV VALSAAVAIA LAMLLARRRC WLAAVGSLAI IASGAAVLFI VPQPQHKSGA LEVTAIDVGQ GDSLLLVYPN GQSMLLDSGG PLGGSHSDFD VGEQVTSPYL WARGINRLDV VAYSHPHSDH MGSMATIIRN FRPRELWLGF APPVRDVENV LQAAREEHVT VRFFRTGDQF GFGGANVRVL LPLRDQEPHM PPKDDDVLVL KIAYGKTSAL LIGDSHKKEE RELIDLAPEA DLLKVAHHGS NTSSSPEFLA AVHPKFGVIS VGARNSFKHP RPEVLERLAS FGVQTYRTDM AGATTFYLDG TNVSVVRR
|
| |