Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5681 |
Symbol | |
ID | 5674007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6896942 |
End bp | 6898384 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244534 |
Product | allantoinase |
Protein accession | YP_001509937 |
Protein GI | 158317429 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR03178] allantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.642788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGGTG GGCAGGCCGG TGCCGCGACG CCGCCGGACG CCGTCGAGGT GATCCGGTCG CGTCGGGTCG TACTGCCCGA CGGCGAGCGC CCCGCGGCCG TCCACGTCGT CGACGGCCGG ATCACCGCGG TCACCGCGCC GGGCGAGGTG CCATCCGGCG TCGCCGTCAC CGATCTCGGC GACCTCGTGC TCATGCCCGG CGTGGTCGAC ACGCATGTCC ACGTCAACGA GCCCGGCCGG ACCGAGTGGG AGGGCTTCGC CAGCGCCACC CGGGCCGCGG CGGCCGGCGG CGTGACGACG ATCATCGACA TGCCGCTGAA CTCGATCCCC CCCACCACCT CGCTGGACGC GCTGGCCGCC AAGCGGGCGG CGGCCGAGGG CCAGGTCGCC GTCGACGTCG GTTTCTGGGG CGGGATCATC GGCGCCGACG CCCGCCGGCT CGACGACCTA GCCGCGCTGC ACGACGCCGG CGTGTTCGGG TTCAAGGCGT TCCTGGCACC CTCAGGGGTC GAGGAGTTCC CGCACGTGAG CCTCGACGTG CTCGCCGCCG CCTCCCGGCA CACCGCCCGG ATGGACGCCC TCACCGTCGT CCACGCCGAG TCGCCCTCCG TGCTCGCCGA GGCGCCTGAG GCGGCCGGCC GCACGTTCGC CAGCTGGCTG CGCTCCCGCC CGCCGGCCGC CGAGAAGGCC GCGGTGGCCT CGCTCGCGGC GCTCACCGCC TCGACGGGCG CGCGTTTGCA CGTCCTGCAC CTGGCGGCGG CCCAGGCGCT CGACGACGTC GTCTCCGCCC GCGAGGCCGG CCTGCCCATG ACCGTCGAGA CCTGCCCGCA CTACCTGACC TTCACCGCCG AGGAGGTCCC CGACGGCGCG ACCGTCTTCA AGTGCGCGCC GCCCATCCGG GAACGCGCGA ACCTGGACCG GCTCTGGGAC GGCCTGGCCG CCGGCCTGTT CGCCGGCGTC GTCACCGACC ACTCGCCAGC CACCCCGGCG CTGAAGTCCG TCGAGACCGG TGACTTCGGG ACGGCCTGGG GTGGCATCGC CTCCGTCCAG CTGGGCCTGG CCGCGGTGTG GACCCAGGCA CGCCGCCGCG GGCACGGCCT CGTCGACGTC GTCCGGTGGA TGTGCTCCGG CCCCGCCGAC CTGGTCGGGC TGGGCGCCCC GGGACTGGCT ACCTCGGGAC TGGGCACCCC GGGACTGGGC ACCGGCGGGG CGGGCTCCGT GCCGAACGGC ACCAAGGGAC GCATCGCCGT TGGCGCCGAC GCCGACCTGG TGGTCTTCGA CCCCGACGCC ACGTTCGTCG TCGAACCGTC CCTGCTGCGC CACCGCCATC CGCTCACGCC CTACGCTGGG CGAACGCTTG ACGGCGTGGT TCTGGCGACG TACCTGCGGG GACGGCGGGC GGACGGTGAC CGGCCCGCGC GAGGCCGGCT GCTCTCCCGA TGA
|
Protein sequence | MSGGQAGAAT PPDAVEVIRS RRVVLPDGER PAAVHVVDGR ITAVTAPGEV PSGVAVTDLG DLVLMPGVVD THVHVNEPGR TEWEGFASAT RAAAAGGVTT IIDMPLNSIP PTTSLDALAA KRAAAEGQVA VDVGFWGGII GADARRLDDL AALHDAGVFG FKAFLAPSGV EEFPHVSLDV LAAASRHTAR MDALTVVHAE SPSVLAEAPE AAGRTFASWL RSRPPAAEKA AVASLAALTA STGARLHVLH LAAAQALDDV VSAREAGLPM TVETCPHYLT FTAEEVPDGA TVFKCAPPIR ERANLDRLWD GLAAGLFAGV VTDHSPATPA LKSVETGDFG TAWGGIASVQ LGLAAVWTQA RRRGHGLVDV VRWMCSGPAD LVGLGAPGLA TSGLGTPGLG TGGAGSVPNG TKGRIAVGAD ADLVVFDPDA TFVVEPSLLR HRHPLTPYAG RTLDGVVLAT YLRGRRADGD RPARGRLLSR
|
| |