Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1367 |
Symbol | |
ID | 5669776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1646827 |
End bp | 1649718 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240294 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001505721 |
Protein GI | 158313213 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGGTC ACCGGCCGAC CGGTCGCCAC GATCTCATCC CACCGCGGTG CACCCGTCGA GCCTTGCGGG CCGGCCCCGC GTGGTTGGCG GTACGTGTCG TCGGCCGAGC GGTGCTCGCG CTCGCCGCGC TGACAGCGTT CTCGATGATC CCGCTGGTGC TGTGGATGTG GCGGGACGTC CCGCTGCCCT CCACCTGGGA ACCAGCCCGG TGGCTGACCC TCGCCCGCCG CGGCTACCTG CACCCCGACA TCGTGCCCAA CACGGTCGCG GTCCTGATCT GGATCGCCTG GGGGCAGGTC GCGCTCGCCC TCCTCCGCGA GATCGCCGCC CAGTTCCAGC GCGGTGCGTC CGCCACCCGT ACCGTGCTGC TTCCCGGTGT GGTCCAGCAC CTCGCGGGCA GCTGGATCGC CTCCCTGTCG ATCCTGTTCA GCCTCCTGGC TGGACGTTCC GCAATGGTCG CCGTGTCCCT CGACCTACCG GCCTCCCACC CGGCCGCGGG GAGCGTTCAG ACCATCGCGG CGAACGCCGC TGAGCCCATG GGGGGACCGG CGCCCAGCAC CTCGGCAGGC AACGCGCTAC CGGCGAACGG GATGCCGACC GAAGGAACCG ATCCAGGCGG GACCACCTGG CGTCGGCACG TCGTGGCACC CGGCGAGACT CTGTGGGGCA TCGTCGGCGG CGAGTACGAC GACCTGGAAC CAGACGCCTT CCCGCCGGCC GTCGCGGAGG TCTTCGAGAC CAACCGCGGC ACCACCGACT CGCTCGGCCG CGCCCTGGAT CGGCCCGACC TGATCAACCC GGGTATGGAA CTGCGGCTAC CGCAGCTTTC CACGGGGCAG CTGCCTGGCG GGACCTATAC GGCCCCGCAG CCCATCTCAC CGCCCGCTCC TGCCCCTGCG CCTGCTCCTC CTGCACCCAC TCCCGCGCCT GCCAGCCCGG CGAGCCCAGC GAGCCCGAGG GCGTCAGCAT CGCCGCCCGT CACTGTGCCC CACGACGTTC CCGCTGTTCC TCCCGGGGAC GAACCAGAGG TGCCCCTCAG CACGTGGATC GGTGGCGCCG GTCTCCTCGC CACCACCCTC GTCGGCCTGT GGGCCGCGCG TCGACGCCGC CGCGACAGCA CCGTCACCCC GGCGCAGGAG ATCCCCGACC CGGATCCGGC CACAGCCGCC CTGCACGCGG CTCTGCTGGA CGCGGACGAC CCGGATCTCC TCGACCGACT CGACGCCGCC CTACGCAGCA TCGGCGCGGC TCACCGTGAT CAACCCGACG GGCCGAGCCC CCAGATCCTG CTCGTCCAAC CGGACCAGAG CATCGAGGTT CTCCTTCATC CCGACGGCGC ACAGACCCTG CCACCGCCAT GGGATGCCGG GCCCGATCCG CGGATCTGGA CCCTGCCCGC CACCGCGGCG CTCACTGCCG CTGCGGACAT CCCGCCACCT TGCCCGGCGC TGGTACAACT CGGCACGACC GCCGCGGGCG CGGCCCTGTA CGCCGACCTC GAAGCCCTCG GCACCCTCGG CATCGCCACG AGCCCGACAG ACCCCAACCG GCTGGCCGAC TTCTGCCGGG CGATCCTTGC CGCGATCGTC GCCTCCCCTT GGGCCGATCT CACCACCGTG CGCACCGTCG GCCTCGACCC GCATGCCTTC GCCGCAGAGG AACGAGTCCA GGCAGCCAAG GACGTCACCG AGCTGACGGA CGACGCTCGC GCCGAGGCCG CCGCCATCGA CACCGTCCTG CGGGAGCGGG GCTACCCCAG CACGCTCAAC GCCCGGATCG CCGAGCCGGG CGAGGAATTC GACCCGACCA TCGCCCTCCT CGCCACTCAC CTGGACACCG ACGCCGGCCG TACCGAAGCG GCCGACCTTG CCGCCGCGGC AGGCCATGGC CGGCGGGGGC TGGCCGTCGT CCTACCGGCC CAGCCCGACA TCCTTACCGC GTGGATGCTG CGCCCCGACC CGGCCGGCCG CGGATCGCGC CTGGACCCCC TCGGCGTGCT CCTGACCCCG GTCGGGCTCA CCGACACCGA CGAAGCCGCC GTCGCCGCCT TCATCGCGGA CGCGGAGGCA CCTCCCATCG ACCTGCCTCC ACCAGATCCG CCCGCCGAGA CTCCTGCGGT GTTCGTCTCG CCGCCCTGGG AGGTGATGGT GCGCCTGCTC GGCCCCATCG ACATCACCAC CCGCGACGGC CGCACCCCGC CGCGGGCCGA GACCCGCGAA CGCACCAACG AGGTCCTCGC CTGGCTGGTC ACCCACCGGC ACGGCACCCG CACCGATCTG GAATCCGCGC TCTGGCCGCG GGGAGCAAAA TCCAAAACCC TGGCGAACGT GCTTGCCCGG GCACGGCGCC TGCTCATCAA CCTCGTCGGT GAGGATGCGA AGAACTGGGT GCCCCGATAT GACCGCGGCC CCCTCACCCT CGACCCCCGG GTGGTCTCCG ACCTGGACAT TCTCCAAGCC CTCCTGCGCC ACGCCATCGA CCAGCGTGAT CACCACGAGA CGGCGATCGC CACGCTGCAG GAGGCCCTCA AGCTGGTCAG AGGTGTGCCC GTCGGCTATC CGTGGCTGGA TGCCCAGATG GGCTCGATCC TGACCACCGC CCCAGTCAAC GCCGCGATCC TGCTCGCCGA ACACCAGCTC GCCGCCGGCG ACACCGCCGG CGTACTGGCG ACCACCGCCC GAGGCATGGA AATCCTCCCG GCCCACACCA CCCTGTTCGC GCTGCGAATG CGCGCCCACG CCGCCGCCGG CGACCCCGAC GCGGTCAAAG CCGAATACCG CTCCTACCTC CGCGCCGAAA AGGCCGAACC CCTCTGGGAC GGCGAAACCG ACCGCGACCT CGAAGCCCTC CACTACAAGC TCACCCGCCG CCGCACAGCC GGATCCGGCT GA
|
Protein sequence | MRGHRPTGRH DLIPPRCTRR ALRAGPAWLA VRVVGRAVLA LAALTAFSMI PLVLWMWRDV PLPSTWEPAR WLTLARRGYL HPDIVPNTVA VLIWIAWGQV ALALLREIAA QFQRGASATR TVLLPGVVQH LAGSWIASLS ILFSLLAGRS AMVAVSLDLP ASHPAAGSVQ TIAANAAEPM GGPAPSTSAG NALPANGMPT EGTDPGGTTW RRHVVAPGET LWGIVGGEYD DLEPDAFPPA VAEVFETNRG TTDSLGRALD RPDLINPGME LRLPQLSTGQ LPGGTYTAPQ PISPPAPAPA PAPPAPTPAP ASPASPASPR ASASPPVTVP HDVPAVPPGD EPEVPLSTWI GGAGLLATTL VGLWAARRRR RDSTVTPAQE IPDPDPATAA LHAALLDADD PDLLDRLDAA LRSIGAAHRD QPDGPSPQIL LVQPDQSIEV LLHPDGAQTL PPPWDAGPDP RIWTLPATAA LTAAADIPPP CPALVQLGTT AAGAALYADL EALGTLGIAT SPTDPNRLAD FCRAILAAIV ASPWADLTTV RTVGLDPHAF AAEERVQAAK DVTELTDDAR AEAAAIDTVL RERGYPSTLN ARIAEPGEEF DPTIALLATH LDTDAGRTEA ADLAAAAGHG RRGLAVVLPA QPDILTAWML RPDPAGRGSR LDPLGVLLTP VGLTDTDEAA VAAFIADAEA PPIDLPPPDP PAETPAVFVS PPWEVMVRLL GPIDITTRDG RTPPRAETRE RTNEVLAWLV THRHGTRTDL ESALWPRGAK SKTLANVLAR ARRLLINLVG EDAKNWVPRY DRGPLTLDPR VVSDLDILQA LLRHAIDQRD HHETAIATLQ EALKLVRGVP VGYPWLDAQM GSILTTAPVN AAILLAEHQL AAGDTAGVLA TTARGMEILP AHTTLFALRM RAHAAAGDPD AVKAEYRSYL RAEKAEPLWD GETDRDLEAL HYKLTRRRTA GSG
|
| |