Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6569 |
Symbol | |
ID | 5674884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7990714 |
End bp | 7993797 |
Gene Length | 3084 bp |
Protein Length | 1027 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641245420 |
Product | transcriptional regulator |
Protein accession | YP_001510812 |
Protein GI | 158318304 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCCAT GGGACTCCTA TGATCGGTTC GTGCGCGTGC GCCTGCTGGG CCCGGTGGAC GTGGTGGACG CCGCCGGGCT GCCGATCGCC ATCGGCAGCC CCACACAGCG GCTGCTGCTG GCCATCCTCG GATCGCGCGT GGGGGACGTG GTGCCTCCCG CGCGTCTGGT CGATGCCGTG TGGGGGGAGT CCCCGCCGCC GTCGGCCGAG GCGACGCTGC GCTCCTACAT CTCGAGGCTG CGCCGGGTCC TCGGGGACGC GCTGCCGACC CATCCCGGCG GGTGGTCGCT GCGGCTCGCC CCGGAACAGG TCGACATCGC CGTCTTCGAG CGTCTGCTGC GGCTGTCCGG GCAGGTCCAG GACGCGTCCG CGCGGCTCGC CGCGTTGGAC GACGCGCTCG CCCTCTGGCG GGGCCCGGCC TTCGGCGAGC TGACCGACGA CCCGGCGCTG CGACCGGTGG CGGTGCGCCT CGGCGAGGCT CGCGGCGCGG CGCGCGAGTC CCGGGCCGCG CTGCTGCTGG CGGTCGGGCG CCCGGCCGAG GCGGTCGGTG CCGCGGAGGA ACTGCTCGCC GAGTACCCGT GGCGGGAGGG CGCGTGGGGG ACCCTCCTCG GGGCGCTGAG CGGTTCCGGG CGGGCTGTCG AGGCCGCGGA CGCCTACCGC CGGGCGCACG CGGCCCTGGC GGAGGTGGGC CTCGAACCGG GCCCGGCGCT GCGCGCGGCC CAGGCCGCGG CACTGTCCAC GGCAGCCCCC ACCGCCGCCC CCACGACAGT TCCCACGGCA GCCCCCACCG TCGTCCCCAC AGCGGCCCCC GTGACGGCTC CCGTCCCTGA AAGGGCTGGG CCGGCTGGGG TGGCGGGCTC GGGACGGCGC CGTCGGCCTC GCCGACCCGT CACGTCCCTG CTGGGCCGGG AGGCTGACAC GGCCGCGGTC TGCGACCTGC TCGCGTCGGC CCGGCTGGTC ACCATGTTCG GGCCGGGCGG GGTCGGCAAG ACGCGGCTGG CGCTGCACCT CGCCGACCGT CTCGCGGACA GGTTCCCGCA CGGCGTCCTG ATGGTGGAGC TGGCGACCGT CGCCGACCCG ACCGCCGTCC CCGCGTTCGT CGCGGACGCG CTGGGCATGC GCGGCGCCGA CGGCGACGCC GGTGCCGCCC TCGAAGCCGC CGCCGAGCTT GACGCGCTCG TGATCCTGGA CAACTGCGAG CACGTGGTGG CCGCGGCCGC CGCCGTCGCA TCGGCTCTGG TCGAGAACGG TCGCGGTGTG CGCGTGCTGG CGACGAGCCG CGAGCTGCTC GGCGTGGACG GCGAGCACTG CTGGCCGGTA CGCCCGCTGC GCGTCGACGG CCCGGACGCA CCCGCGCACG CCCTGTTCTG GGACCGCGTC CGAGCCGCCT GCCCGGCGCC GGTTCCGGGG GCCGGACCGG CCTCGGACGG AGACCTCGAG GCGGCCGACC GCATCGTCCG GAGGCTGGAC GGCCTGCCGC TCGGCATCGA GATGGCGGCC GCGCGCAGTG GCACGATGTC CCTGCCCGAG CTCGCCGACC GGCTCGAGGA CCACCTCGGC CTGCTCCGGG ACGTCCGCCG GGTGGGAACG CCGCGCCACC GCACGCTCGC CGACGTGGTC GGCTGGTCGG TCTCGCTGCT GGACGCGCCG CACCGCGCGA TGCTGCGGAC CATGGGCGCG TTCGCGGGCC CGGTGAGTCC CGCGGACGTC GCCGCGCTGG CGGGCCTGCC CGAACCGGAG GCGCTGGATC TGCTCGACGC CCTGGTGGCC CGGTCGCTGG TGGTCGTCGA TCCGTCCCGG GTACCGGCCC GCTACTCGCT CCTGGAGATC ATCCGCGAGT ACGCGCGACG GCAGCTCGGC GCGTCCGACG CCGTGCTGCG TCAGGAACAC GCCCGGTACA TCCTCGCCGA GGTGCGGGCC GCGGACGCCG TCCTGCGCAC CCCGCGGGAA GCGGCGGGGC ACAGCCGGAT CACCGACCTG ATGGTCGAGG TCCGCGCTGC CCAGACATGG GCGCTGAGCG AAGATCCCCC GCTGGCCGTG GAGATCGCGG CCGGCATGCA CGTGTTCGCG ATGACGCGGC TGCGGGTCGA GCCGTTGCAC GCCGCGGTGG AACTCGTCCG CTCGCTCGGG CTGCACCCGC CGCCGGGCGG CGAGCCCCAC CGGCTGGCGG ACCTGGTGTC GGATCTCGCC GCGGGCCTGC CACCCGGCGT GACGGCGGGG GCCGCGGCGA CCACCCTGGC CACGGCGGCA TGCTGGTACG TCAGCGCCGG TGAGCTCGAG CTCGCCGTGG CCTGTGCACA GCGCGGACGG CTGATCGCCG GTGACGCGCC CGAACGCCGG TTCCCGCTCG AACTGCTCAG CGACACCGCC TCCTACCAGG GCCGGATCGG CGAGGCCGTC GACCACGGCT GGGAGCTCGT CGCCGCGGCG CGGTCCTGCG CCGACCCGCA CGCCGAGGTG GTCGGGCTGC TCAACGTGGC GATCGCGCAC GCCTACGCGG GGCACGCCGA GGACGGCCGG GCCGCCCTCA GCCAGGCCCC CGCGGGGCCG CTCGCGCCGT CCGAGCTCGG CTGGCTGGCC TACGGCGAGG CCGAGCTGAT CCTTGACCGC GACCCGGACC GCTGCCTGCG GCTGCTCGAC CGCGCGGTCG CCCTGGCGGA TTCGGTCGAC AACCCCTACC TGGGCGGGGT GGCACGGGTG TCCGCGGTGT CCGTCCGGGC CCGCTGCGGT GACCCGCAGC AGGCCGTCGA GGCTTTCGCG CAGGTGCTGC GGCACTGGCG TGACCAGTAT GCGCTGACCC ACCTACTGAC CACGTTGCGT AACCTCGTCG TGCTCTTCCA ACGCCTCGGC CGGCCGCGGC CGGCGGCCCG GCTGCTCGGC GCCGTCACCT CGCAGGCGGT CAAGCCGAGC TACGGCGCGG AGGCCGCGAT GCTCGCGGGA GCCGACAGCT GGGTGGACGA CGCCCTCGGC TTCGCCGCCG CCACCGCCGA ACGCGCCGCC GGAGCCACCC GCACCGTCAT CGCCGCCACC GAGACAGCCC TCGACGACCT GGCTGACATC ACCGCGGAGA TCCGCGGCGG CTGA
|
Protein sequence | MDPWDSYDRF VRVRLLGPVD VVDAAGLPIA IGSPTQRLLL AILGSRVGDV VPPARLVDAV WGESPPPSAE ATLRSYISRL RRVLGDALPT HPGGWSLRLA PEQVDIAVFE RLLRLSGQVQ DASARLAALD DALALWRGPA FGELTDDPAL RPVAVRLGEA RGAARESRAA LLLAVGRPAE AVGAAEELLA EYPWREGAWG TLLGALSGSG RAVEAADAYR RAHAALAEVG LEPGPALRAA QAAALSTAAP TAAPTTVPTA APTVVPTAAP VTAPVPERAG PAGVAGSGRR RRPRRPVTSL LGREADTAAV CDLLASARLV TMFGPGGVGK TRLALHLADR LADRFPHGVL MVELATVADP TAVPAFVADA LGMRGADGDA GAALEAAAEL DALVILDNCE HVVAAAAAVA SALVENGRGV RVLATSRELL GVDGEHCWPV RPLRVDGPDA PAHALFWDRV RAACPAPVPG AGPASDGDLE AADRIVRRLD GLPLGIEMAA ARSGTMSLPE LADRLEDHLG LLRDVRRVGT PRHRTLADVV GWSVSLLDAP HRAMLRTMGA FAGPVSPADV AALAGLPEPE ALDLLDALVA RSLVVVDPSR VPARYSLLEI IREYARRQLG ASDAVLRQEH ARYILAEVRA ADAVLRTPRE AAGHSRITDL MVEVRAAQTW ALSEDPPLAV EIAAGMHVFA MTRLRVEPLH AAVELVRSLG LHPPPGGEPH RLADLVSDLA AGLPPGVTAG AAATTLATAA CWYVSAGELE LAVACAQRGR LIAGDAPERR FPLELLSDTA SYQGRIGEAV DHGWELVAAA RSCADPHAEV VGLLNVAIAH AYAGHAEDGR AALSQAPAGP LAPSELGWLA YGEAELILDR DPDRCLRLLD RAVALADSVD NPYLGGVARV SAVSVRARCG DPQQAVEAFA QVLRHWRDQY ALTHLLTTLR NLVVLFQRLG RPRPAARLLG AVTSQAVKPS YGAEAAMLAG ADSWVDDALG FAAATAERAA GATRTVIAAT ETALDDLADI TAEIRGG
|
| |