Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3848 |
Symbol | |
ID | 5672211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4570873 |
End bp | 4571874 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242726 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001508146 |
Protein GI | 158315638 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0846255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.776114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCC TGGCCGGAAT GCTGGACGGG CCGCGGGCGC GGGGGGCGTT CACGATCCGC TCGGTCATGT CCCCGCCCTG GTCGATGCTC ATCAGGGACC AGGCGCCGCT GACCGTCGTC GCGGTCGTGC ACGGCGAGGC CTGGGTGCTC CCCCGGGACA CCCCCGCGGT GCAGCTGCGC GCGGGCGACG TGGCCGTCGC CCGCGGGCCC GACCCGTACG TCGTCGCCGA CGACCCGGCT ACCCCGCCGC GGATCGTGAT CCATCCCGGG CAGGTCTGCA CCACGCTGTC GGGTGCGAGC CTCGCCGCGC AGATGGATCT CGGCGTGCGC ACCTGGGGCA ACCAGCGCGA CGGCGCGACG GCCCCGCCGA CGACGATGCT CACCGGCACG TACCAGCAGC ACAGCGAGGT CAGCCGCCGG CTGCTCGACG CGCTGCCCGC GCTGGCCGTC ATCCGCGCGG ACGACTGGGA CTGCCCGCTG GTACCCATGC TGGCCCAGGA GATCGGCCGG GACGACCCGG GCCAGGCCGC CGTCCTCGAC CGCCTCCTGG ACCTGCTGCT CGTCACCGCG GTGCGGGCCT GGTTCGCCCG CCCCGACACC GACGCGCCGC CGTGGTGGCG GGCCAACGGC GACCCCGTCG TCGGCCACGC GCTGCGGCTG CTGCACAACC ACCCCGAGCG CCCCTGGACG ATCGCCGCCC TCGCCGCGGC CACGGGCGTC TCCCGCGCCT CGTTCGCGCG CCGGTTCGCC AGCCTGGTCG GCGAGCCGCC CATCGCGTTC CTGACCGGCT GGCGTCTCAC CCTGGCCGCC GACCTGCTCC AGGAGCCGGC GGCCACGGTC GGCGCGGTGG CCCGCCAGGT CGGCTACGGC AGCCCGTTCG CCCTCAGCAC GGCGTTCCGC CGCCGGTACG GCGTCAGTCC GCAGCAGTAC CGCGCCCGCG CCCACGGCGA CCGGGCGGAC CAGCCGCCGT CCGACGGCCG CCCGGCGGAC GCGCCCGGAT GA
|
Protein sequence | MDVLAGMLDG PRARGAFTIR SVMSPPWSML IRDQAPLTVV AVVHGEAWVL PRDTPAVQLR AGDVAVARGP DPYVVADDPA TPPRIVIHPG QVCTTLSGAS LAAQMDLGVR TWGNQRDGAT APPTTMLTGT YQQHSEVSRR LLDALPALAV IRADDWDCPL VPMLAQEIGR DDPGQAAVLD RLLDLLLVTA VRAWFARPDT DAPPWWRANG DPVVGHALRL LHNHPERPWT IAALAAATGV SRASFARRFA SLVGEPPIAF LTGWRLTLAA DLLQEPAATV GAVARQVGYG SPFALSTAFR RRYGVSPQQY RARAHGDRAD QPPSDGRPAD APG
|
| |