Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3349 |
Symbol | |
ID | 5671720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3962038 |
End bp | 3963924 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242237 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001507657 |
Protein GI | 158315149 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATCG AGGTGCTGGG CGCGGTGAGG GCACGTCGCG AGGACGGGAC CGAGATCGAC CTGCGTGGCC CCCGGCACCG TGAGGTCCTT GCCCGCCTCG TCGCCGCCGA CGGACGGATG GTCGCCACCG ACGCCCTTGT CGCCGACCTG TGGCCCGACC CGCCGGCCGG TGCCGTCGGC GCCCTCCGCA CATTCGTCGC CGCCCTGCGC CGCGCCCTCG AGCCCGAGCG CCCGCCCAGG ACGCCCCCAC GTGTGTTGGT GACCGAGGGG CCCGGTTACG CGCTGCGCCT GCCACGGGCG GACGTCGACG CCCACCGCTT CGAGGACGCG CTCGCCGCCG CCCGGCGCTC GTCCGACGCG TTCGCGCCGC TCGGCGTGGC GCTCGCCGCG TGGCGCGGAC CGGCCTACGC CGGCCTGCCC GACGCCGGGT GGGTGCGCGG CGAGCGCCGC CGACTCGAGG AGCTGCGGCT GCAGGGCGTG GAGCTGCAGG CGGGCATCCT GCTCGATCGC GGCCAAGGGG CCGATCTCGT CGCCGAGCTC GACGCGCACG TCACCGAGCA TCCGTGGCGC GAGCCGGCGT GGGGACTGCT CGCCCGCGCC CTCTATCGCG CCGGGCGCCA GGCCGACGCG CTCGCCACGC TCCGGCGCGC CCGCGCGATG CTCGTCGACC AGCTCGGGCT CGACCCGAGC CGCGCACTGC GCCGGCTTGA AGCGGACATG CTCACCCAGT CGCCCGCCCT CGAACCGCGA GTCTCCGAAT GGCCGGCCAT CACCGCCCGT CTCGGCCCAC GCACGACCGT CGACGTCGCG CGCGCCCTGG CCCTCGCGGG CGGCGACGCG CTCGTGTCCT CCCGGCGCGA CCGGCTCGCC GCCGCGCTCG CCGCGGAACG AACGGGCGAT GTCGCGCTGA CCGCCCGCGT CATCGGTGCC TACGACGTGC CCGCGATCTG GAACCGTGCC GATGACCCGG TACAGGCCCG CGCCGTCGTC GCCGCGGCCG AACGCACGCT CGCCACGCTC GGCCCCGCCG CGCCCGCCGA CCTGCGGGCC CGGCTCCTCG CCACGATCGC CGTCGAGAGC CGCGCCGAGG ACGCCGCCGG GCCGGCACGG GAGGCTGAGG CGTTGGCACG GTCTCTCGGT GATCCCGCCC TGCTCGCCTT CGCGCTCAAC GGCGTCTTTC TGCAGTCCTT CTCCCGGCCG GGACTCGCGC AGCGGCGCGA CGACATCGGA GCGGAACTCG TCGCGGTGTC CACCCGGCAC GGGCTGCCCA CCTTCGCGAT TCTCGGCCAT CTCGTACGCC TGCAGTCGGC GTCGGCCCGC GGTGATCTCG ACGCTGGCTC CCAGCACGCG GCCGCGGCCG AGCGGCTCGC CACCGAGCAC GAGTCGCCGC TCGTGCCGGT GCTCACCAGG TGGTTCCGGG CGCTCGTCAT CGCGGCCCGC AGCGCCGCGC CCGGCGGGCC GTCCGCCGCC GCGGCCGCGG CCGCCTACCG AGCCGCCGAC ACCGCGCTCG AACAGGCCGG CATGCCGGGC CTGCACCGCG GCCTGCTACC GCTCGCCCTG CTCGGGCTCC GTCTGCTGCA CGACCGCCCC GCCCCGATCG ACCCACGGCT GGACTGGGGC CCGTACACGC CGTGGGCGGC ACCGCTCGTG CTCCTCGCGC AGGACCGTCG AGAACAGGCA CGAGCCGCCC TCGCCGCGAC GCCCGAACCG CCGCACGACC ACCTCCAGGA GGCGCTCTGG TGCCTCACCG CCCACGCCGC CGCCCAGCTC GGCGAGCACG CGATCGCCGG GCGGGCCGCG GCGGCCCTGC GGCCCGCCCG CACCGAACAC GCCGGCGCCG CCAGCGGCAT GCTCACACTC GGCCCGGTGG CGCGCTACCG CGACTAG
|
Protein sequence | MRIEVLGAVR ARREDGTEID LRGPRHREVL ARLVAADGRM VATDALVADL WPDPPAGAVG ALRTFVAALR RALEPERPPR TPPRVLVTEG PGYALRLPRA DVDAHRFEDA LAAARRSSDA FAPLGVALAA WRGPAYAGLP DAGWVRGERR RLEELRLQGV ELQAGILLDR GQGADLVAEL DAHVTEHPWR EPAWGLLARA LYRAGRQADA LATLRRARAM LVDQLGLDPS RALRRLEADM LTQSPALEPR VSEWPAITAR LGPRTTVDVA RALALAGGDA LVSSRRDRLA AALAAERTGD VALTARVIGA YDVPAIWNRA DDPVQARAVV AAAERTLATL GPAAPADLRA RLLATIAVES RAEDAAGPAR EAEALARSLG DPALLAFALN GVFLQSFSRP GLAQRRDDIG AELVAVSTRH GLPTFAILGH LVRLQSASAR GDLDAGSQHA AAAERLATEH ESPLVPVLTR WFRALVIAAR SAAPGGPSAA AAAAAYRAAD TALEQAGMPG LHRGLLPLAL LGLRLLHDRP APIDPRLDWG PYTPWAAPLV LLAQDRREQA RAALAATPEP PHDHLQEALW CLTAHAAAQL GEHAIAGRAA AALRPARTEH AGAASGMLTL GPVARYRD
|
| |