Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1688 |
Symbol | |
ID | 5670090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2019335 |
End bp | 2020273 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240606 |
Product | GntR domain-containing protein |
Protein accession | YP_001506032 |
Protein GI | 158313524 |
COG category | [K] Transcription |
COG ID | [COG2186] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0152268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.243042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGACG GGCTATCGCC GGCCGGCCGG AATACAGCGT CCGGCGCGTT CCACGGCGCA GCACGGACTT CTCCACGCGG CGCCGCGCCC GCTGTCGGCT CCGCCCTCGG CGGGGGGCAC GCCGCGGGCC GCCGCGCCGG CGGCCATGCC GAGAACGGCG GGGCGATCGG CCTCGTCGTC GGCGAGCCGG AGGAACCGAC CTCCCGCTTC GGGCAGCGCA CCCTCGCCTC CCGACTGGCC CGCACCATCG AGCTGGAGAT CATCCGGCGC GGGTGGCCGG TCGGCGAGGG GCTCGGGTCG GAAGCCCAGC TCATGGAGCG CTACCAGGTC GGGCGGTCGG TGCTGCGCGA GGCCGTCCGG CTGTTGGAGA GCCGATGGGT GGCTCGGCCG CGGCCCGGTC CCGGCGGCGG CCTTGTGGTG ACCGCGCCCG ACCACGACGG CGTCCGCGAC GTCGCCCGGG TCTACCTCGA CTTCGCCCGC GCCCGGCCGG AGCACCTCTG CGAGACCTGG TCCGCCCTGG AGGCCGTGGC CGTCCGCAGG CTCGCCGAAC GGATCGACAC GGCCGGCCTG ACCCGGCTGC GCGGTGCACT GCAGCCGGCC ACGCCGGGCG GGCGGGCCGC GGGAGCGCGC TCGCGGACAC GGTCAGCCGG CACCCGGCCG GCGCCGGTCG CGGCGGGCTC CACCGGGGAG GCCCCGGTGC TGCTGCACCT TGAGATCGCC CGGCTGGCGG GCAATCCGGC CGCCGAGCTG TTCATCCGGG TCCTGGCGGA CCTCGCGCAT CCCCGCGAGG AGACCGACCG GCTCGGGCAG TGGTGGCAGC ACCCGCTGCA CGCCGAGATC GTCGACGCGA TCACCCGCGG CGAGGGCGCG CTCGCCGAGC ACCTGGCCCG GTCCTGCATC CGGCGGCACG TCGAGGAGGC GACCGCCCAC CGGCACTGA
|
Protein sequence | MSDGLSPAGR NTASGAFHGA ARTSPRGAAP AVGSALGGGH AAGRRAGGHA ENGGAIGLVV GEPEEPTSRF GQRTLASRLA RTIELEIIRR GWPVGEGLGS EAQLMERYQV GRSVLREAVR LLESRWVARP RPGPGGGLVV TAPDHDGVRD VARVYLDFAR ARPEHLCETW SALEAVAVRR LAERIDTAGL TRLRGALQPA TPGGRAAGAR SRTRSAGTRP APVAAGSTGE APVLLHLEIA RLAGNPAAEL FIRVLADLAH PREETDRLGQ WWQHPLHAEI VDAITRGEGA LAEHLARSCI RRHVEEATAH RH
|
| |