Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5136 |
Symbol | |
ID | 5673470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6153647 |
End bp | 6155113 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243986 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001509400 |
Protein GI | 158316892 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.033803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCT CGACAGGACC GTCGGCTGGT CGTGCCACGC CGAGGACTGC GCAGGCGAAC GACCACAGGC CGGCATCGGC CGGGATCACC CGCACTCCGA ACGTGCGGTT GCGTACCTGC CGGGAGGAAC GCGGCTGGTC GCAGGAACGA CTGGCCAGTG AGATCCGGCG TTTTTCCGTC ATACACGAGG GCCGCGAGGC CGGCGTGACG GGAAATATGA TCTGCAAATG GGAGAAGGGC GATAAAAAGC CCAGCCTTCG TTACCAGCGG CTCCTGCGGG CCCTGTTCGA GCGGTCGTCG GCGGAGCTCG GCTTCGTGGA CGACGACCCC AACACAGGGC TTCCGGCGCA CGCGTCCGGC GCGGACAATG CCTCCCGGGA GATCGTGCCG GCCGGCACGC TGCTCCTGCA GGCCGCCGAG GCCGACGCCG GCCTGCACAA CGCTCTCCCG GTGGAGCGGC GTGGGTTCCT GCGGCTGTTC GCGGCGGCCG GCGGTGTCGC GGTCGTCCCG CTGGGCATGG GTGGCGACGA CGCGCCCTGG GAGCGGCTGT CCGCCGCGCT GCGCCGGCGC ACCACGGTCA CCCCCGAGCT CGCGGACGAG CTGAGCCGCT GCACCGCGGG CCTGTACGGC CTCGAGGAGC GGGTTCCGGC CCGGGCCCTG TTCTCCCGGG TCACCGGGCA TCTGGGGACG CTCACGCAGC TCCTGGAGTC CAGCGGCCGC TCGCCGGTCC GCCGGGACCT CGCCTCCACG GCCGGCGAGA CCGCCGCCCT GGCCGGTTGG CTCGCCTTCG ACATGAACGA CGTCCCCTCG GCCCTGGCCT ACTACCGGGT CGCCATCGAG GCGGCGCGGG AGGCCGACGA CAGCGCGCTG TGGGCCTGCG TGCTGGGCTA CGAGAGCTAT CACAGCGCGG GCATCGGCCG TCACGACCAG GCCTGCGCGC TGCTGGCCGA GGCGCAGCGC CGTGCCGCGA CGGGCAGCAC CGTCATGACG AAGGCCTGGC TGGCCGGGCG GGAGGCCGAG GAGCAGGCGG CCCGCGGTGA GGGGCGGGCC GCGCTGGCCG CCCTCGACCG CGCCCAGGAC GCGTTCGACC GCGCCGACGA CGGCGACCGG GTCTGGACGC AGTTCTTCGA CCGCGGCCGT CTGGACGGCA TCAAGGTCAC GACCTACACC CGGCTGCGGC GCCCGGCCGC GGCCCACGCG GCGGCGACCG AGGCACTGCG CGCCACCACC CCGCACAGCG GCACCAAGAA GCGGTCCCTG CTGCTCGGTG ACATCGCCGA GGTCCACATC CAGCGCCGGG AGATCGAGGC CGCCACCCAG TACGCGACCG AGTCGCTCGC CATCGTCGCG GCGACTGACT TCTCCCTCGG GCTGACCAGG GTCCGCCGCG TCCGGGAGCA TCTGCGGCCC TGGCAGCAGA CGCAGGCCGT CCGCGACCTC GACGAGCAGC TCCGCGCGCT CACCTGA
|
Protein sequence | MQRSTGPSAG RATPRTAQAN DHRPASAGIT RTPNVRLRTC REERGWSQER LASEIRRFSV IHEGREAGVT GNMICKWEKG DKKPSLRYQR LLRALFERSS AELGFVDDDP NTGLPAHASG ADNASREIVP AGTLLLQAAE ADAGLHNALP VERRGFLRLF AAAGGVAVVP LGMGGDDAPW ERLSAALRRR TTVTPELADE LSRCTAGLYG LEERVPARAL FSRVTGHLGT LTQLLESSGR SPVRRDLAST AGETAALAGW LAFDMNDVPS ALAYYRVAIE AAREADDSAL WACVLGYESY HSAGIGRHDQ ACALLAEAQR RAATGSTVMT KAWLAGREAE EQAARGEGRA ALAALDRAQD AFDRADDGDR VWTQFFDRGR LDGIKVTTYT RLRRPAAAHA AATEALRATT PHSGTKKRSL LLGDIAEVHI QRREIEAATQ YATESLAIVA ATDFSLGLTR VRRVREHLRP WQQTQAVRDL DEQLRALT
|
| |