Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2102 |
Symbol | |
ID | 5670502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2526305 |
End bp | 2527264 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641241023 |
Product | helix-hairpin-helix DNA-binding motif-containing protein |
Protein accession | YP_001506444 |
Protein GI | 158313936 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGAA GCTCCTCGCA TCTGTCCCGG GGTGATGACC GGGGTGACGA CTGGCCGCTG GCCGGCCTGG ACGGCGGCGA CCCGGACGGC GGCGACCCGG ATGGCTGGGA TCAGGACGGC GGGGCCTGGG ACGGCGGGGA CTGGGGTGAC GACGGGGTAC GGGACGAGGG GCGGGCGGAC ACTCGCGGCT CGCGGGTGCG GCGGGCCGTC GCCGCTCGCC TCCCGCCGAC GCTGCGCGAC GCGGTGCTCG CCCCGACGGC CCGGGCGGCG CTGGTGCTGG TGTCGGTCGC CGTCGCGGCG GCGGCCGTGG CCGGCTGGCT GACCTGGCGC GACCGGCCGG TGGCCCTCCA GCCGAGCGCG GCGGCGGTCG GTGATTCGGG GCCCGGTGAG TCCGCGCGCG GGCCAGGCGG GTCCGCGACC GCGGCCGCGG AACTCGCCCG GAACCCGGCG GAGCCGGCGT CGGGCGAGGC GGCGCCGACC GCGGCGCCGG AGGTCGTCGT GGACGTCGCC GGCCGTGTGG CGCGCCCCGG GGTCGTGCGC CTGCCGGCCG GATCCCGGGT CGTGGACGCC GTGGAGCGGG CCGGCGGTGT GCTGCCCGGC ACCGACACGA CGGGGCTGGC ACTGGCCAGG GTCCTCACGG ACGGTGAGCA GGTCCTCGTC GACGGGCGGC CGGGTCCCGC CCCGCCGCCT CCGGAGGCGG CCGGTGGCGG CTCGTCCTCC GGGGCGTCAG CCGGTTCGGC CGGCGGTTCC GCGGTTGGTT CGGCGGGCCG GCCGCTGAAT CTGAACACCG CCACGGTGGA ACAGCTCGAC GCCCTGCCCG GGGTGGGTCC CGTGCTGGCC CAGCGGATCA TCGACTGGCG GGCGGCGAAC GGGCCCTTCA CCTCGCCCGA CCAGCTCGGT GAGGTCTCCG GTGTCGGTGA CCGGCGGCTG GCGGACCTGC TGCCCCTGGT GACGGCCTGA
|
Protein sequence | MTRSSSHLSR GDDRGDDWPL AGLDGGDPDG GDPDGWDQDG GAWDGGDWGD DGVRDEGRAD TRGSRVRRAV AARLPPTLRD AVLAPTARAA LVLVSVAVAA AAVAGWLTWR DRPVALQPSA AAVGDSGPGE SARGPGGSAT AAAELARNPA EPASGEAAPT AAPEVVVDVA GRVARPGVVR LPAGSRVVDA VERAGGVLPG TDTTGLALAR VLTDGEQVLV DGRPGPAPPP PEAAGGGSSS GASAGSAGGS AVGSAGRPLN LNTATVEQLD ALPGVGPVLA QRIIDWRAAN GPFTSPDQLG EVSGVGDRRL ADLLPLVTA
|
| |