Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3605 |
Symbol | |
ID | 5671974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4266218 |
End bp | 4267654 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242491 |
Product | transcriptional regulator, TrmB |
Protein accession | YP_001507911 |
Protein GI | 158315403 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCAGA TGATCGCCTT CCTGCGCGCC GCTGGAGGCG ACACCACCGA GGTGGAGGTC AAGTCTGCCG CCGGCGGGTT ACCCGCCTCG TTGACCTCTA CGATGAGCGC GCTGGCGAAT CAGCCCGGGG GCGGGACCAT CATCCTGGGG CTCGACGAGC GGGCCGGGTT CCGCCCCGTC GAGCTCAGCG ACCCGCAGAT GCTCAAGCAG GGCCTGGCTG CCAGGGCCCG GGCATTCACA CCGCCGGTCC GTCTCACGAT CGACGACGGG GAGGTCGACG GGGCCACGGT CGTGGTTGCG CAGGTCCACG AGTGCGACCG CTCGACCAAG CCCTGTCGCG TCACCGCGAC CGGCAGGGCG TACCTACGTG GCTACGACGG CGACTACGCC CTGTCCGACA TGGAGGAACA GGGGTTTCTG GCCGCTCGCC AGCCACCGCT GTTCGACCGT TCACCTGTCG AGGACGCCAC CATCGACGAC CTGGACACCG AACTCGTCGA TACTTTTCTA CTCGCCGTCC GCGAACGCGA CCCCGCCGGG CTCGGCCGTT TTCCCGACGA CACGGAGCTC CTGCGCCGGG CTGGAGTCAC GATGGAAGGC GGGCAGCCGA CCGTCGCGGG ACTGCTCGCT CTCGGGGTCC ATCCCCAACA ATGGTTCCCT CGCTACGTCA TCCAAGCCGC CGCGCAGCCC TTGCCCACAG ACTCCGCCGC AACGCGGGCC CGCAACCAGG TCACCATCAG CGGGCCGGTC CCGCGGATGC TTGACGCGGC GCTACTCTGG GCCCGACGTA CCTTCGACAC CGCCATCGTC GCCGAGATGG ACGGTAGCGT TCGTGACCGT CCGATCTACC CACTCGTCGC CTTCCGTGAG TTGGTCGCCA ACGCACTGAT CCACCGCGAC CTCGACCACT GGTCCGCCGG GCTGGCCGTC GAAGTGCGGC TTCTGCGGGA CCGCCTGGTA GTGGCCAATC CCGGAGGCCT GTACGGCATC ACCGTCGACC GACTCGGGCG CGACGCGGTG ACCTCCGCCC GCAACGCCAG ACTGGTCGCG ATCTGCCAGC ACGTCCGCTC CCCGCAAACC GAAGCTCGGG TCATCGAAGC CCTCGCCAGC GGAATTCCCA CCGTCACCGA GGCCCTCGCC GACCATGGCC TGCCGCCAGC CCACTACGTG GACAGCGGCA TCCGGTTCAC CGTCGTCCTC CACCAGTTCG CGACCGCCCC GCCCGCGGCG ACCGCCGAGC CCCCAATGGG CGCCACAGAG CGTCGCGTCT ACCAGACCCT GACCCGTCCG GGAAGAACAG TCAGCGACCT CGCCGAGGAG CTCGGGCTGT CCGCTCCGAA CATCCGCAAG GCACTGCGAA GCCTGCGCGG TCGCGGGCTG ATCCTTCAAC TCGGCGGCAG AGGCAAAGCC ACCACCTACC AGCGGACGGA CTCATAG
|
Protein sequence | MSQMIAFLRA AGGDTTEVEV KSAAGGLPAS LTSTMSALAN QPGGGTIILG LDERAGFRPV ELSDPQMLKQ GLAARARAFT PPVRLTIDDG EVDGATVVVA QVHECDRSTK PCRVTATGRA YLRGYDGDYA LSDMEEQGFL AARQPPLFDR SPVEDATIDD LDTELVDTFL LAVRERDPAG LGRFPDDTEL LRRAGVTMEG GQPTVAGLLA LGVHPQQWFP RYVIQAAAQP LPTDSAATRA RNQVTISGPV PRMLDAALLW ARRTFDTAIV AEMDGSVRDR PIYPLVAFRE LVANALIHRD LDHWSAGLAV EVRLLRDRLV VANPGGLYGI TVDRLGRDAV TSARNARLVA ICQHVRSPQT EARVIEALAS GIPTVTEALA DHGLPPAHYV DSGIRFTVVL HQFATAPPAA TAEPPMGATE RRVYQTLTRP GRTVSDLAEE LGLSAPNIRK ALRSLRGRGL ILQLGGRGKA TTYQRTDS
|
| |