Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1921 |
Symbol | |
ID | 7407334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2026225 |
End bp | 2028483 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643716293 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_002573782 |
Protein GI | 222529900 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00848355 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGAA AGGCGTTATT TGTTGCCAGT TTTATGATGA TAGGAATTGT TCTGGGCAGA AATATAAAAA AAATTGAGGT ACTTGTATTT TGCCTTTTGC TAATTTTAGG AGCGCTGTGT GCTACCTACT ATTTTCTTCC TCAGTACTTT AAAAAGGAAA AGTTTATGTT TATTTTATGT TTTCTTTTTC TCACGCTGCA GCTTTTCAGG ACTTATTACA TTTTCAATAT TCTTGAGCCT CAAAAAAATC TGGATGGGAA ACATGTTTAT ATTGTTGGCA ATATCTGCTC ATTTCCCGAG ATAAGCGATA AAAAGACTTC CTTTTACCTT AAAACAAAAC TAAATTCAAA GGCTGTTGTT ATTAGAGTCA CAACAGAGTC TAAAAAAAGT ATTTTTTATG GAGATACTGT AAAAGTTTCT GGAAAACTTA AAATTCCAAA AGGAAAGACA AGTAAATTTG GTTTTGATTA CAGAGAATAT TTGAAAGGCA AAGGTGCTAT TTATACACTT TACTCAAAAG ACATAGAGGT TATCTATCAA GGAAAAAATG TTCTCAATCT TCTCAATAGA TTTTCTACAC AGTTAAATAA CCTCATAGAT AGCTCTTTTG AAAATGATAT ATCTTCGCTT TTAAAAGGTT TGATTCTTGG CAACAAATCT ACAATTCCAG ATGATGTGTA CAAAGACTTT CAGCGAAGCG GACTTGCTCA CCTTCTTGCA GTCTCTGGTG GAAATGTAGG GGTGCTTTGC GCTTTTGTTG AGATTTTGTT CAGAAGAATA TTAAAGATAT ATGGTAAAGG AGTAAACTTT TTAATAATAG GTGTCATAGT TATTTTTGCT ATTGTCACAG GGATGTCAGC ATCAGTTGTT AGGGCTTCGA TTATGGCGAT AATCTTCTAT GCTGGAAGGA TTATTTACAG AAATCCTGAT ACGCTCAATA GCCTATCTGT ATCAAGCGTT TTGATGCTGC TTGTAAATCC GCTTTTTCTT TTTGACATTG GGTTCCAGCT GTCTTTTTTG AGTGTTCTTT CAATAATTCT GTTTTGTAAA GGGATATATG AATATTTTGC GAAGTTAAAG ATACCAAGAG GTATATCTTC ACTTATTGCA GTTTCAATTT CTGCTCAGAT TTTAATATTG CCGTTGATAG CTTATTATTT TTCTGAGATC TCAGTTATTT CATTCTTGAC AAACATAGTT GCTGTACCAG TTGCAGGTGC TGTTGTACCG GCTGGGCTGC TGTATTGTCT ATTATTGGTT TTCAATATAG ATATATTACC ATTTAAATGG TTTTTAGAAG TCTGTGTGAA CGTGCTAATG TACCTTTCAA GATTATCTTA TGTAGGATTT TCGCATGTAA AGGTCATTTT ATGGGATGAA AAGCTAATAT TTTGTTACTA TCTTGTTGTG GCATCTTTAA TTTTTAGAAA ATTTATAAAC AGGCAACTAA AATATGTGAT ATATTTGAGT ATTTGTGGAC TGCTTGTGGC ATTTATCTTA CAGACACTTA TAAATTACAA CAGGCTCATC ATAAACGTGA TAGACGTAGG GCAGGGAGAC AGTAGCTTTA TTACATACAA GGGATTTTCA ATGCTGATTG ACACAGGGCC TGAATATGAA GATTTTAGCA GCTTGAAAAG AATTGTTCTT CCGTATATAC TCAAAAGAGG AGTAGCAAAA CTTGATGTTT TAGTCTTGAC ACACAAGCAC AGCGACCATA TGGGGGACTT TGAGTATCTG CTTTATGAGA TGAAAGTGGA CACAATTGTA ACATCAAAAG AGGTATATTT TGAAAATGCT CAAAAGTTCA AAGGGCAAAA GGTTGTGTTA GTAGATAGCT TGAAAGTTTA TCGCTACAAG GATTTAAAAG CCTATTTTAT CCCACCAGTA GAGGAAGATG AAAATAGTTC TGTTGTTGTG AAGCTGACCC TTGGCAATTT TTCTATGCTA TTTACAGGTG ATGCCTCATA TGAGTCTGAA AAGGAATACG TAAAGAAATA TAACTTGCAG ACAAAGATCT TAAAGGTGGG ACACCATGGA AGCAGCACAG CAACATCTGA AGAGTTTTTG GAAAATGTAA AGCCAACATT TGCAGTAATT TCTGTAGGGA AAGACAACAT CTTTGGGCAT CCTTCGAATG AGGTCTTACA AAGACTCAAA GACAGAAACA TTAAGGTGTA TAGAACAGAT TTAAATGGAA CCATAGACAT TATAGTTGAC AGAAATAAGA TGATGGTAAA TCCGTATATT GTGAGGTGA
|
Protein sequence | MTRKALFVAS FMMIGIVLGR NIKKIEVLVF CLLLILGALC ATYYFLPQYF KKEKFMFILC FLFLTLQLFR TYYIFNILEP QKNLDGKHVY IVGNICSFPE ISDKKTSFYL KTKLNSKAVV IRVTTESKKS IFYGDTVKVS GKLKIPKGKT SKFGFDYREY LKGKGAIYTL YSKDIEVIYQ GKNVLNLLNR FSTQLNNLID SSFENDISSL LKGLILGNKS TIPDDVYKDF QRSGLAHLLA VSGGNVGVLC AFVEILFRRI LKIYGKGVNF LIIGVIVIFA IVTGMSASVV RASIMAIIFY AGRIIYRNPD TLNSLSVSSV LMLLVNPLFL FDIGFQLSFL SVLSIILFCK GIYEYFAKLK IPRGISSLIA VSISAQILIL PLIAYYFSEI SVISFLTNIV AVPVAGAVVP AGLLYCLLLV FNIDILPFKW FLEVCVNVLM YLSRLSYVGF SHVKVILWDE KLIFCYYLVV ASLIFRKFIN RQLKYVIYLS ICGLLVAFIL QTLINYNRLI INVIDVGQGD SSFITYKGFS MLIDTGPEYE DFSSLKRIVL PYILKRGVAK LDVLVLTHKH SDHMGDFEYL LYEMKVDTIV TSKEVYFENA QKFKGQKVVL VDSLKVYRYK DLKAYFIPPV EEDENSSVVV KLTLGNFSML FTGDASYESE KEYVKKYNLQ TKILKVGHHG SSTATSEEFL ENVKPTFAVI SVGKDNIFGH PSNEVLQRLK DRNIKVYRTD LNGTIDIIVD RNKMMVNPYI VR
|
| |