Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6796 |
Symbol | |
ID | 5675109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8282488 |
End bp | 8285844 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245645 |
Product | transcriptional regulator |
Protein accession | YP_001511036 |
Protein GI | 158318528 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCATGT CATTGCGTCC TGTCCGTGCC GGTAGCCCTG AACAGCCGTC TCCCGGCAGG AATCTTTGCC TCCAGATTCT CGGTCCGTTG CGGATCTGGC GGGGCGGCGT CGAGCTGGAC GCCGGGCCCC GGCAACAGGC CTGCCTGCTC GCTCTGCTCC TCATCCGGGC GGGCCGGCCG ATCAGCACGA GCGAGCTGAT CGACCTGATC TGGGACGACG ACGTCCCGGC ATCGGCTGTC AACATCCTCC AGAAATACGT CGGCACGCTG CGGCGATTGC TGGAGCCCGC GCTGCCGGCC CGCGGAACCG GCTCGTACCT GCAGCGCCGC GGCAACGGTT ACCTGTTTTC GGCCGACCCC GGCATGCTGG ACGTCGTCAC CTTCCGGGAA ATCGCCGGAA AGGCCAGGAC ATGCCTCGCG GAGCAGCGCC TCGATGCGGC GCTCGACTGC TACGTACAGG CGCTGGCGCT CTGGCACGGC CCCGCGGGTG GCGGGCTGAC CCACGGATCG ACCGCCGTGT CGCTCTTCGC CGCGATCAAC GACGAGTTCT TCGACGCATG CGTACCGGCG GCCGAGCTCG CGGTGACGCT GGGCCAACCC GAGCTCGTGC TCCAGCCCCT GCGCCTGGCC GCCTGGATGG CGCCGTTGCA CGAAATCGTG CAGGCAAGCC TTGTCTTCAC CCTGGCCGCA GCCGGTCAGC AGGCTGAGGC GCTGTCAGTG TTCGGGACGG TCCGCGCCCG GCTCGCCGAG GAGCTCGGCA TCGATCCTGG GCCCGCGCTG CGGGCCGCGC ACCTGCGGGT CCTTGGCCAT CCACCGACCT CGGCGGCCTC GGCCGGCGCG GACGACGACG GGCCGGCGGC GACGGCGGGC GCGCCGGCGG GCGGGCACGC AGGCGGGCCG CTGCCAGGGC AACCACCGAC CGAGCTGCCC ACGCTCGTCG GCAGCGCCGG GAAGCCGTCC GCCGAGCAGC CGCGGGCTCC TTCCCTCGAG ACACCTACCG CCGACGGCAT GATCGGCCGA GCCGAGGAGC TCGCGGTGCT GCGGCACGCG GTGGACTCGG TGTTTGCCGG CGGCACCGGG CTCGTCGTCG TCGAGGGCGA GCCGGGGGTG GGCAAGACGC GCCTGCTGGA GGAGGCCGGC GCGGAGGCGG ACAGGCGCGG CGCGCTCGTC GTCTGGGGCC GCTGCCTGGA AGGCGACGGG ACGCCGTCGA TGTGGCCGTG GGTGCAGGCG GTCGGCACGG TCCTCGACAG CCTGCCCGCC GCGGCGCGGG AGGAGTGGCA CGCCGGCGAA CTCGGCCGCC TCGTGGAGCC GCGCGGCGGC GTTCCCGCCA CGCCGGTGCT GCCGGACAGC GGCGCCCAGT TCCGCCTGTT CGAACGGGTT GTCGCCCTCG TCGGCCAGGT CTCGGCGCGG CGGCCGGTGG TGCTCGTGAT CGACGATCTC CAGTGGGCGG ACGTCGCCTC GCTGCAGATG TTCAGTCACC TGGCGGCACG CCCGCCGGGC GGCGCTGTGA TCACCGGCGC GCTCCGCGAC CGTGTGCCCG TGCCCGGCTC GGAGCTGGCG CGGATGCTCG CCGCCGCGAG CCGGCTGCCC CGGCACCGCC GGATCCGGCT CGGCCCGCTC GACGCGGCCG AGGTGGCCGA GCTCGTCCGC CGCGAGACCG GTCAGACCCC CGACGTCGGT GTTACCCAAA GCATCCATGT CCGCACTGCC GGCAACCCCT TCTTCGTGCG GGAGCTGTCC CGGCTCCTCG CCGACAGTGG GGTTCTCACT GAGGATGCCG CGGCGCGGGC CGGTGTGCCG TCCACCGTGC GGGATGTCGT CCGCGACCGG ATGGCGGGCC TTGACGACGA CACCGGCGGC CTGTTGCAGA TCGCCGCGCT CATCGGCCGG GAGATCGACC TCGGCCTGCT CGCACGCGCG GCCGACCTTG ACGCGCAGAC CTGCAGCGAG CGCCTCGAGC CCCTGGAGGC GCTCGGCCTG CTCGAGCCCA CGCCCGGGGA CCCGTTCTCG TTCCGCTTCG CGCACGACCT GGTCCGGGAG TCGGTCGTCC GGACGACGCC GCCGCTGCGC GCGACCCGGC TGCACCTGCG CGTCGCCGAC GCGCTGGAGC GCTCCGACAC CGGCGGCGAG TCCGTAGCCG AGCGCCTCGC CCACCACCTG TGGGCCGCCG GCCCACTCGC GGACCCGGCC CGGACCTCGA GCGCGCTGGA GCGCGCCGGG CGCCGCGCCG CGGCCAAGTC CGCGTTCGAG GCCGCCGCAC GGCAGCTGGT GTCGGCCGCG CAGGTGGCGA GGACAGCGAG CATGTCGGAG CGGGAGCTGT CCGCCCTGTC GCAGCTCACC GCGGTCGTCG GGATGCGGTC CGGGTACGTC GGCTCCGCGG TCGACCTGCT GGAGCGGGCC GAGGAGCTGG CTCGTGACCT CGGCCGGGAA CGGGAGGCCG CGGACTTCCT CTTCTCCCGC TGGGCCGCGT GCTCCCAGGG CATCCAGCTC GACCGCGCCG GCCGGCTGGC GCGCCGGCTG CTCGATCAGG GCGAGGCGTC CGCCGACCCG GTCGTGCGTG CCTACGGCCG GCACGCCTGG GGCATCCACC AGTGGGACGT CGGTAACATC GGCGAGGCGT TCCGGTATCT GAGCCAGTCC AACTCGATCA TGTTCGATGG CCTGGCCCAG CGCGAGGATG ATCCGCTCCG GCACGACCTG CAGCTGCTCT CGCCCGTGAT GCTCGCGTTG AACACCGCGC TGCACGGTGA CGTCGACGGG GCGCGGGCGC TGCTCGACAG GCTGGAGGCC GCCGCGCGTG GCGACTCCTA CGCGATCACG GTCTGGGCCG CCTTCGCCGT CACGGTAGCG GCGCTGGCCG GCGACCCCGC CTGGGCACTA CGCGCAGCGG AACGGGGAAT CGCGGTGGAC CCGGAACATT CCTTCGTCTT CCTCGGCAGC TACCAGCGAC TGGCCCGGTG CTGGGCGCGG GCCGTGACCG GCGAAGACCC GGCCGGCGCC GCGACAGAGG CCAAAATGAT CATCGCGGCG ACCCTGCTCG ATCCGCCACG CTCGGGCCTG GCCACCTGGT ACGGACTGCT CGCCGAGATG TGGCTGGCGG CCGGGATGCC GGCCGAGGCC ACCGCCACCC TCGACCGGGC CGACTCGTTC CTCGACACCT ACGGCCAGCG CTACCCCGAA GGCCTGATAC TCCTGCTGCG GGCACGGATG ATGCAGGCAC GCGGCGAGCC CACCGCCGCC GTCCAGGCCG CCGTCGAGCG GGCTCGCGCG CTGTCCGTCG AGCGCGAGGC TCACCTGTTC GCCCACCGCG CCGAGGAATT GTCGGCCGGG CTGGCGACGG AGCCGGCCGG CCACTGA
|
Protein sequence | MAMSLRPVRA GSPEQPSPGR NLCLQILGPL RIWRGGVELD AGPRQQACLL ALLLIRAGRP ISTSELIDLI WDDDVPASAV NILQKYVGTL RRLLEPALPA RGTGSYLQRR GNGYLFSADP GMLDVVTFRE IAGKARTCLA EQRLDAALDC YVQALALWHG PAGGGLTHGS TAVSLFAAIN DEFFDACVPA AELAVTLGQP ELVLQPLRLA AWMAPLHEIV QASLVFTLAA AGQQAEALSV FGTVRARLAE ELGIDPGPAL RAAHLRVLGH PPTSAASAGA DDDGPAATAG APAGGHAGGP LPGQPPTELP TLVGSAGKPS AEQPRAPSLE TPTADGMIGR AEELAVLRHA VDSVFAGGTG LVVVEGEPGV GKTRLLEEAG AEADRRGALV VWGRCLEGDG TPSMWPWVQA VGTVLDSLPA AAREEWHAGE LGRLVEPRGG VPATPVLPDS GAQFRLFERV VALVGQVSAR RPVVLVIDDL QWADVASLQM FSHLAARPPG GAVITGALRD RVPVPGSELA RMLAAASRLP RHRRIRLGPL DAAEVAELVR RETGQTPDVG VTQSIHVRTA GNPFFVRELS RLLADSGVLT EDAAARAGVP STVRDVVRDR MAGLDDDTGG LLQIAALIGR EIDLGLLARA ADLDAQTCSE RLEPLEALGL LEPTPGDPFS FRFAHDLVRE SVVRTTPPLR ATRLHLRVAD ALERSDTGGE SVAERLAHHL WAAGPLADPA RTSSALERAG RRAAAKSAFE AAARQLVSAA QVARTASMSE RELSALSQLT AVVGMRSGYV GSAVDLLERA EELARDLGRE REAADFLFSR WAACSQGIQL DRAGRLARRL LDQGEASADP VVRAYGRHAW GIHQWDVGNI GEAFRYLSQS NSIMFDGLAQ REDDPLRHDL QLLSPVMLAL NTALHGDVDG ARALLDRLEA AARGDSYAIT VWAAFAVTVA ALAGDPAWAL RAAERGIAVD PEHSFVFLGS YQRLARCWAR AVTGEDPAGA ATEAKMIIAA TLLDPPRSGL ATWYGLLAEM WLAAGMPAEA TATLDRADSF LDTYGQRYPE GLILLLRARM MQARGEPTAA VQAAVERARA LSVEREAHLF AHRAEELSAG LATEPAGH
|
| |