Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6420 |
Symbol | |
ID | 5674735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7795709 |
End bp | 7797331 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641245268 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001510663 |
Protein GI | 158318155 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2169] Adenosine deaminase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.178925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.589237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCCA GTCCGCATCT CGACGCCGAG CTGTGCGCGC GGGTGGTGCG GTCCCGGGAC GCCCGGTTCG ACGGGTGGTT CTTCGTCGCG GTGACCTCCA CCGGCATCTA CTGCCGGCCG AGCTGCCCGG CGCGGCCACC CAGGACCGAG AACATGCGGT TCCACCCGAC CGCCGCGTCC GCCCAGCGCG CCGGGTTCCG GGCCTGCAAG CGCTGCCGCC CGGACGCGAG TCCCGGCTCA CCGGAGTGGA ACCACCGCGC CGACGTCGTG GCGCGCGCGA TGCGGCTCAT CGTGGACGGC GTCGTCGACC GTGAGGGTGT CCCCGGGCTT GCCGGTCAGC TCGGCTACAG CGCACGCCAG GTGGAGCGGC ACCTGTCAGC CGAGCTCGGT GCCGGGCCGC TCGCGCTGGC CCGCGCCCAG CGCGCCCAGA CCGCCCGGCT CCTGCTCGAG ACCACGGACC TGAGCATGGG CGACGTCGCG ACCGCCGCCG GCTTCAACAG CATCCGCGCC TTCAACGACA CCATGCGCGA GGTCTTCGCC GCCGCGCCGA GCGATCTGCG ACGCCGTCGC AACCCGGCGA GGCCCCCGTC CGTCGTGCCG GTACCGAGGG CGGCTACCGG TGCCGTGACG TTGACCGTCC GGCTGCCGTT CCGGGCCCCG CTGTATCCCG ACAACCTGTT CGGGCACCTC GTCGCGACGG CCGTTCCCGG GGTCGAGGAG TGGCGGGACG GTGCCTACCG GCGCACGATG CGCACGCTGC ACGGGCACGC GATCGTCGCC CTGCGGCCGC TGCCCGACCA CATCGGCTGC CGGCTCGCCC TCACCGACGT GCGCGACCTC GCGCCGGTCA TCGGCCGCTG CCGCCGGCTG CTCGACCTCG ACGCGGACCC GATCGCCGTC GACGGGCAGC TCGCCGCCGA CCCGGCGCTG GCGCCGCTGG TCGCGCGGGC ACCGGGCCGG CGTGTTCCGC GCACCGTCGA CCCGGCCGAG CTCGCGGTGC GCGCAGTCCT CGGACAGCAG GTCTCCGTCG CGGCGGCGCG GACCCACGCC GCGCGGCTCG TCACGGCCGT CGGCACGCCG ATCCATGATC CGGAAGGCGG CCTCACCCAC CTCTGGCCAC AGATCGCGGA CCTCGCCGAG CACATCGAGC GCACCGAGTA CGCCGAGTGC ACCGACCTCG CGGACGCTGT CCCGGCCGGC CGCCGGGCCG GGGCGCCGCG CGGGCTCGCC CTGCCGGCCG CCCGGCGGCG GACCTTCGCC GCGCTGGTCG GCGGGCTGGT GTCCGGCATG ATCGAGCTGG GTGCGGGCGG AGACTGGGAG CGGGCCCGCG CCGCGCTGGC GGCTCTGCCC GGCATCGGCC CGTGGACGCT CGAGACCATC GCGATGCGGG CCCTCGGTGA CCCGGACGCA TTCCTGCCCG GTGATCTCGG TGTCCGCCGA GGGGCCGAAC GGCTCGGTCT GCCGGCCACC CCCGCCGCGC TGTCCCGGCA TGCCGCCGCC TGGCGCCCCT GGCGGGCCTA TGCCGTCCAG CACCTGTGGG CGGTGCTCGA CCATCCAGTC AACCGGATGC CCGCGCCGGA TCATCCCGGC CCGGTCACCG CCCGCCGAGA GGAACGCCTG TGA
|
Protein sequence | MPASPHLDAE LCARVVRSRD ARFDGWFFVA VTSTGIYCRP SCPARPPRTE NMRFHPTAAS AQRAGFRACK RCRPDASPGS PEWNHRADVV ARAMRLIVDG VVDREGVPGL AGQLGYSARQ VERHLSAELG AGPLALARAQ RAQTARLLLE TTDLSMGDVA TAAGFNSIRA FNDTMREVFA AAPSDLRRRR NPARPPSVVP VPRAATGAVT LTVRLPFRAP LYPDNLFGHL VATAVPGVEE WRDGAYRRTM RTLHGHAIVA LRPLPDHIGC RLALTDVRDL APVIGRCRRL LDLDADPIAV DGQLAADPAL APLVARAPGR RVPRTVDPAE LAVRAVLGQQ VSVAAARTHA ARLVTAVGTP IHDPEGGLTH LWPQIADLAE HIERTEYAEC TDLADAVPAG RRAGAPRGLA LPAARRRTFA ALVGGLVSGM IELGAGGDWE RARAALAALP GIGPWTLETI AMRALGDPDA FLPGDLGVRR GAERLGLPAT PAALSRHAAA WRPWRAYAVQ HLWAVLDHPV NRMPAPDHPG PVTARREERL
|
| |