Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2245 |
Symbol | |
ID | 4445167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2525395 |
End bp | 2526375 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639690054 |
Product | helix-hairpin-helix repeat-containing competence protein ComEA |
Protein accession | YP_831725 |
Protein GI | 116670792 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACGCC GGAACGCGGA AGCGGGAGCA CCTGCCGCGG CCAGCCGCGC GCGCCGGCGC CTGGCAACCA CACTGGGGCC TTCCGGCGGC GGTGACGGCG GTGGATCCGA TGCCGGGAAC CCGGGTTCGG CGGGGCTCCT GGGGTACGGC GCCGGGGTTC ACGGCCACGG GTCTTTCACG TATGACGGCG GTGCGCAGCC GGAACCTGGC GCCGTTGGCG GCTCGGCCGC CGATTCATCC GCCGTTTCCG GAACACATGC ACCGGCGAAG TTCCGCTGGC GCTCGGGGTT TCGCGTTGCG GTGCTGCTGG GACTGCTGAG TCTCCTGCTG GGCGGCTGGT TCTGGTGGGA TGTGGCAGCC AGCCGTCCCC ACGTGGTGCC CTTGAGCGAC GTCAGCAGCC CGGAAGTCAG CGCGCAACAG GAAGGACATC CCGGTTCAGG GCCGGGAGGA TCCGACGGCG CCTCAACGGG ACGACAGCCT TCAGGTGCCG CATCCGGAGC GAAGATCATT GTCCACGTGG CGGGGGCCGT CAACCGGGCG GGCGTTGTGG AACTGCCGGA AGGCAGCCGG GTCCACGAGG CCATCGCCGG TGCGGGTGGA AGTGCGGAAG GGGCTGACCT GAACCGGTTG AACCTTGCCG CCGTCCTGGC CGACGGCCAG AAAATCCACG TTCCGCTGGT GGGGGAACCG GTGGACGCCC CTGGGGCGGC AGCCGGCGCC ACCGGACCGG GAGCCGCGGG ATCGGGGCCC GGCGACTCAG TCCCCGGCCA AACGGGAGCG GAGGGCGGGA AGATCGACCT GAATTCGGCG TCTGCCGAGG AACTGGGCGC ATTGCCCCGG GTTGGTCCCG TGCTGGCCCA GCGCATCGTC GATTGGCGCA AGGAACACGG CCGGTTCAGC ACCGTTGAGG AACTAGACGC GGTGGACGGT GTTGGCCCCA AGATGCTGGA AACGCTGCTG CCCCTCGTTC GGGTGTCCTG A
|
Protein sequence | MSRRNAEAGA PAAASRARRR LATTLGPSGG GDGGGSDAGN PGSAGLLGYG AGVHGHGSFT YDGGAQPEPG AVGGSAADSS AVSGTHAPAK FRWRSGFRVA VLLGLLSLLL GGWFWWDVAA SRPHVVPLSD VSSPEVSAQQ EGHPGSGPGG SDGASTGRQP SGAASGAKII VHVAGAVNRA GVVELPEGSR VHEAIAGAGG SAEGADLNRL NLAAVLADGQ KIHVPLVGEP VDAPGAAAGA TGPGAAGSGP GDSVPGQTGA EGGKIDLNSA SAEELGALPR VGPVLAQRIV DWRKEHGRFS TVEELDAVDG VGPKMLETLL PLVRVS
|
| |