Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1982 |
Symbol | |
ID | 7293443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2233976 |
End bp | 2234800 |
Gene Length | 825 bp |
Protein Length | 274 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643590386 |
Product | competence protein ComEA helix-hairpin-helix repeat protein |
Protein accession | YP_002488045 |
Protein GI | 220912736 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000000014049 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGGACG ATGACCATGC CCGGGATGTC GACGGCGGGG GTTTCGAGTA CCGCGATGCC GCGGCCGGTG CTTTGGTTGC CGGAGACCCG CAAACGGCGG AGCGCGGCGG GGAGGCGCCA CACCTGACCG GCACCGGTCC TTCGCTCCGC TGGCGCCTGG GCCTGCGGCT CGCCGTAGCG GTCGGCCTTC TGGCCGTCAC AGCCGGTGTC CTGTTCTGGT GGCAGACCGC CGGTGGACGT CCTGAGATCC TGCCGTTGAA TACGGTCAGC CGCGAAAGCG GCCCGGCGCC GGATGCAGCA ACGGATAGCC AGGTCACCCC GGGCGCGGGG GAAGGTGGTG CCGGCCCGGA CCACCCCTCC ACGTCGGCAG CGGATGTCGT GGTGGTCCAC GTATCAGGAG CCGTCCTGGC CCCCGGCGTG GTGACCCTGC CGGCCGGGAG CCGGGTCCAT CAGGCCATCT CGGCTGCGGG CGGTGCCGCT GCCGACGCTG ACCTGGACCT TCTCAACCTC GCCGCCGCGG CTGAAGACGG GCAGAAAATC CACATTCCCC GGCAGGGCGA ACAGCCAGCG GCGGGCGGCG CAGGATCCCT GGGTCAGGGA AGTACCGCGC CGCCCGGGTC CGCCGGAGCC GGTGCCAGGA TCAACATCAA CACTGCCGGC GTTGAGGAGC TGGACGCCCT GCCGAAAGTC GGTCCCGTCC TGGCCCAGCG GATTGTCGAC TGGCGGAAAG AGCACGGGCC CTTCAGCGCT GTCGAGGACC TGGACGCTGT GGACGGCGTG GGACCCAAGA TGCTTGAAGC GCTGCTGCCC CTGGTGACCG TCTGA
|
Protein sequence | MLDDDHARDV DGGGFEYRDA AAGALVAGDP QTAERGGEAP HLTGTGPSLR WRLGLRLAVA VGLLAVTAGV LFWWQTAGGR PEILPLNTVS RESGPAPDAA TDSQVTPGAG EGGAGPDHPS TSAADVVVVH VSGAVLAPGV VTLPAGSRVH QAISAAGGAA ADADLDLLNL AAAAEDGQKI HIPRQGEQPA AGGAGSLGQG STAPPGSAGA GARININTAG VEELDALPKV GPVLAQRIVD WRKEHGPFSA VEDLDAVDGV GPKMLEALLP LVTV
|
| |