Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3682 |
Symbol | |
ID | 5672048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4359147 |
End bp | 4360769 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242565 |
Product | TetR family transcriptional regulator |
Protein accession | YP_001507985 |
Protein GI | 158315477 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0191871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCCA GGGCGGGTGG CGAGCGTTCG CGGGCCGGGC GCCCGACGGC GACGGAGGCA GCCAGGCTGA CCGAGCGGCT CCGGCGGGCC GCCGTCGACA CGTTCCTGGA GTCGGGTTAC GACGGCACCA CGATGGAGGC CGTCGCGCAG GCCGCGGGGA TCACCAAGAG CACCTTGTAT GCCCGCTATC CCGACAAGCG AACTCTGTTC ATCGCGGTGA GCTCGTGGGC GCTCACCCGC CAGGAGCGCG ACGAGCGCGT CCTCGAACCG CTGCCCGACG ACCTCGCCGA GAGCCTGACC GTGATCGCGC GGGCGATCCT GGCCCGCTCG GTCGACCCCG ACATCGTCCG CCTGAGCCGG ATGGCGATCG CCGAGTCGGC GCGCTTCCCC GAGTTCGCGG CGAGCTCGCA GGCCGTCACC TGGTCACCGC GGGTGCAACT GATCATTGAT CTGCTACGCA GGCACGAGAG CGCGGGCACC GTGGTCGTCG GGGACGTCGA CCTCGCCGCC GAGCAGTTCT TCGCCATGGT CGGCGCGATG CCGGCGTGGC TGGCCGCCTA CGGCATCTAC CGGACCCCCG AGGTCGAGGA GGAGCACCTC CACCACGCGG TGAGCCTCTT CCTCAACGGC GTCCTGGCCA GGCCGGAGAC CATGCCGGCC CCGCACCCGA CCGAGGCCGC CAGGCCCGCC GGGGAGGGAC CGGCCGTCCC TCGTGCGAGC TGGCCGATGG AGTACCGGCG GCTCGGCCGC AGCGGGCTGA ACGTCTCGCG GCTCGCGCTG GGAGTCGCGG CTCTCGGCGC CGGGCCCGCG GACCATGACG AGCGGGCCGC GATCGGCGTC ATCCACCGCT TCCTCGACGC CGGCGGCAAC CTCCTCGACA CCACCGCCGC GGGCGACATC GGCCGGGGTG GCACGGACCC GGACGGCGCC GCGGCGGAGC TCTGCGGCCG TGCCGTTCGG GACAGGCGCT CGAGCGTCGT CCTGGCCTCG AGCGTCGGGC GGCCGAGCGG GCAGGGCCCC CACGACGGCG GGAACAGCCG CCGTCACATC CGGGCCGCCT GCGAGGCGAC CCTCCGCCGG CTCCGGACGG ACTATCTCGA CCTCCTCCAA CTCGACGCCG ACGACCCGAC GACCCCGCTG GAGGAGACGA TCGACGCGCT GGACGACCTG GTGCGCGCCG GGAAGGTCCT CTACGTCGGC GTCGCCAACC TGCACGTCTA CCGGGTGACG AAGGCGCTGT CGGTCAGCGA CCGGCTCGGC CGGGCCCGTT TCATCTCGTT CCGCGGCCCG TACGGCCTGC TCTCGCGGGA GCTCGAACAC GAGCACCTCC CGCTGCTGGC GGAGGAAGGC CTCGGCCTGA TCAGCACGAG CTCACTCCGC TCCCCCGGGC ACGGGCACGG GCACGGGCAC GCCACTGTCG CCGCCACGGA GGCCGCGGCG GCGGAGCTCG GGTGCACGAC CACGCAGCTG TCGCTGGCCT GGCAACTGAC GAGATCCGTC ACCTCGATCA CGCTCGACGT CGCCTCCGCG GCCCAGCTGG ACGAGCACCT CGCGGCCCTG GGCATCGAGA TCCCCACCGA GATCGCGGCA GCACTGGAAC AGGTCTCCCG TCCCCAGGGG TAA
|
Protein sequence | MESRAGGERS RAGRPTATEA ARLTERLRRA AVDTFLESGY DGTTMEAVAQ AAGITKSTLY ARYPDKRTLF IAVSSWALTR QERDERVLEP LPDDLAESLT VIARAILARS VDPDIVRLSR MAIAESARFP EFAASSQAVT WSPRVQLIID LLRRHESAGT VVVGDVDLAA EQFFAMVGAM PAWLAAYGIY RTPEVEEEHL HHAVSLFLNG VLARPETMPA PHPTEAARPA GEGPAVPRAS WPMEYRRLGR SGLNVSRLAL GVAALGAGPA DHDERAAIGV IHRFLDAGGN LLDTTAAGDI GRGGTDPDGA AAELCGRAVR DRRSSVVLAS SVGRPSGQGP HDGGNSRRHI RAACEATLRR LRTDYLDLLQ LDADDPTTPL EETIDALDDL VRAGKVLYVG VANLHVYRVT KALSVSDRLG RARFISFRGP YGLLSRELEH EHLPLLAEEG LGLISTSSLR SPGHGHGHGH ATVAATEAAA AELGCTTTQL SLAWQLTRSV TSITLDVASA AQLDEHLAAL GIEIPTEIAA ALEQVSRPQG
|
| |