Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1257 |
Symbol | |
ID | 3906103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1500564 |
End bp | 1501832 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637878591 |
Product | helix-hairpin-helix DNA-binding, class 1 |
Protein accession | YP_480364 |
Protein GI | 86739964 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0994426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAAT CCACCCCTGG TCCCGGGTAC CCGCTGTCGG CCCCGTCGCC GCGTTTCCCC GTCCACGCAT TTCCGCCTGA AACGTCCGTT TCGTCGCGAT GGCCATCGCT GGACCTGCTG CCGTCGGACG ACGCCACCGT GCCGCTGCGT CCGCCGGACC CCCCAGCGGA CGGCTCAGCT GGTGGATCCC GCCGCGTTCG TGACGCCACG TTCACCGAGG ACGAGCAGGA GATCGGTGGC GACGAGTCCG ACCTGTTCGG GATTCGCGAG GGCGTGTTCC TTGATGAGGA CCGGGCCGCC GGCGGATGGG TCGCGGACGA GCTCGCGGCG GAAGGGGCCG ATCTTGGCCG GCTGGTCGCC GGCCGAACGG ATCGGCGCCG TCCGGTCCAC TCCCGTCTGG TCCACCCCCG TGCGGGGCGC GAGATGGACG ATGGCGAGGA CGAGGGCGGC CGGGGACCTG ATCGCGGGCC ATCGGCCGGC CGGCCGGCGC CGGGAGTTCC ACGGGGCGAT CTGGTCGGCG TCATCCGCAG GCGGCTTCCG GCTACGCTGC ATGGCATCGT GGTGGCGCCG GCCGCCCGGG CCGCCCTCGT TCTCGCGTTG GTCGCCCTCG CCGCCGCGCT CATGACCGCC TGGTTCTCCT GGCAGCACCG CCCGGTGCCG CTCGCCTCCT CCGAGGTCGA TACCCCCCGG ACGGGGGAGT CGGCGGCGGC TGGCAACGCC CGGTCGGACC ACTCGGACCA CGGGGCCACC TCCGCCACGG ATACGAGCGT GACTTCGGCA CCGACGGCCA GGGCCGCCCG CACCGGGGCG AGCGGCGAGG TGGTGGTCGA CGTCGCCGGC CGGGTGGCCC GGCCGGGGGT CGTCCGGCTC CCGGCGGGCG CGCGCGTGGT GGATGCGATC GAACGCGCCG GGGGAGTGCT CCCGGGCACC GACACGACCG GCCTCGCCCT GGCCCGGTTG CTGGTCGACG GAGAACAGGT CCTCGTCGAC GGCAAGCCCG GCCCGGCGCG GCCCGGAACG GCGGCCGGGC AACCTGCGGG TACTGGCCTC GGGTCCGCCG GTTCCACCGC GGCGACCGGT CCGATCGACC TGAATGCCGC GACCGCCGAG GAGCTTGACG GTCTGCCCGG GGTGGGGCCT GTGCTGGCTC GCCGCATCGT CGAGTGGCGC ACGGCGCACG GGCCCTTCCG GTCGCCCGAG CAGCTGGCGG AGGTGACCGG AGTCGGCGAC AAGCGGCTGG CCGATCTGCT GCCATTGCTG AAGGTTTGA
|
Protein sequence | MVESTPGPGY PLSAPSPRFP VHAFPPETSV SSRWPSLDLL PSDDATVPLR PPDPPADGSA GGSRRVRDAT FTEDEQEIGG DESDLFGIRE GVFLDEDRAA GGWVADELAA EGADLGRLVA GRTDRRRPVH SRLVHPRAGR EMDDGEDEGG RGPDRGPSAG RPAPGVPRGD LVGVIRRRLP ATLHGIVVAP AARAALVLAL VALAAALMTA WFSWQHRPVP LASSEVDTPR TGESAAAGNA RSDHSDHGAT SATDTSVTSA PTARAARTGA SGEVVVDVAG RVARPGVVRL PAGARVVDAI ERAGGVLPGT DTTGLALARL LVDGEQVLVD GKPGPARPGT AAGQPAGTGL GSAGSTAATG PIDLNAATAE ELDGLPGVGP VLARRIVEWR TAHGPFRSPE QLAEVTGVGD KRLADLLPLL KV
|
| |