Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1604 |
Symbol | |
ID | 5670007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1920069 |
End bp | 1921133 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240523 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001505949 |
Protein GI | 158313441 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGACGG TTCTTGATCT CGACCACCTG CCGGCGCACC GGCGCGCCGA GGTCGCCCGG GACGCCATCG TCGCGCTCAC CGTCACCAGC CGGTTCAGCG TGCACGCCCA GCACGACATC CGGGGCCGGA TCGAGTCGTG GATCTTCGGC GAGGTCGGCC TGATCCGCAC GACCGTCAAC GCCGGCCAGC ACATGATCCG CACCGCCCGG CACGTGCGCC AGGACAGCGC TCCCCCGGTG CTGTCCTTCG CGCAGCGCCG GCACGGGCGC GCCGTCCAGG AGCAGTTCGG GGTCCGCCGC GAGCTGGCCG CCGGCGAGCT GTACTGCACC GACCTGAGCT CCGCGTTCGA GTACGCCAAC GCCGAGGGCG GATGCGGCCA GGGCCTGCAG GTACCGCTGG CCTCGCTCGG CCTGCCCGGC GAGGTCGTCC GGCGCGGCGC CCCGAACCTC GCCCGCAGCC CGCTCTACCC GCTGGTCACC GACCATCTCA CCGCGCTGGC CCGCGACGCC GCGGCCCTCG AGCGGGACGA GTCCGCCGCG GGCGTCGGCA CCGCGACCGT CGAGCTCGTC CGGGCGTTAC TGGTGAGCGC GGCCCGCGGC CCCGACGAGC CCCCGGGAAC ACCGGCGGAG ATCATGCTGG CCCGCGTGCG GCACCACGTG CTGGCCCACC TGGCCGATTC CGATCTGAAC GCGGACGGCA TCGCGGCCGC GGTCGGGGTG TCCGTGCGTC ACCTCTACCG GCTGTGCCGG GAGGCGGAGT TCAGCCTGGA ACAGTGGATC GTGCGCAACC GGCTGGAACG GGCCCGCGGC GCGCTCGCGG CCCGGGACGC GCGCGGCCGC AGCATCGCCG CCATCGCGCG CGCCAACGGC TTCGCCGACC CCTCCCACTT CAGCCGCCGC TTCCGGGCTG CCTACGGCAC CACCCCCCAG GAATGGCGCC GCCACCACAC CCCCCACCAC ACCTCCCCGC CGCAAACCCC CACCCCGGAG CCCGGGTCGG GGCCCGCCCC CAACCCGGAC TCAAGGTCAT CTCGCGGTGC CGGAAGGTGC GTGGCGGGTG AGTGA
|
Protein sequence | MATVLDLDHL PAHRRAEVAR DAIVALTVTS RFSVHAQHDI RGRIESWIFG EVGLIRTTVN AGQHMIRTAR HVRQDSAPPV LSFAQRRHGR AVQEQFGVRR ELAAGELYCT DLSSAFEYAN AEGGCGQGLQ VPLASLGLPG EVVRRGAPNL ARSPLYPLVT DHLTALARDA AALERDESAA GVGTATVELV RALLVSAARG PDEPPGTPAE IMLARVRHHV LAHLADSDLN ADGIAAAVGV SVRHLYRLCR EAEFSLEQWI VRNRLERARG ALAARDARGR SIAAIARANG FADPSHFSRR FRAAYGTTPQ EWRRHHTPHH TSPPQTPTPE PGSGPAPNPD SRSSRGAGRC VAGE
|
| |