Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4198 |
Symbol | |
ID | 5672553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4998404 |
End bp | 4999621 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641243071 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_001508488 |
Protein GI | 158315980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0589061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.780011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTTC TGGCCGGCGC CCGGGTGGTG ACGCCGCACG GTGTCCTCGA CCCCGGCCGG GTGCGCGTCG AGAACGGCCT GATCACCGAG GTCGGCACCG AGGCCGGCAC CGAGGTCGGG CCCACGGCCG GGCCCGTCGG TGGGGAGGCG GGCGGCGAGG GGACGGGCGG CGAGGACATC GTCGACCTGG GCGGGTCCTG GCTGGTGCCC GGGTTTGTCG ACCTGCACGT CCACGGCGGG GGCGGGCACG ACGTCACGGC GTCACCGGCC GATCTGGCCG CGGCGGTGGC CTTCCACCGG GCGCACGGCA CGACCCGCAC GCTGGTCTCG CTGGTGGCGG CGCCGGTGGA GCGCCTGGCC GAGCAGTTGT CCTGGGTGGC GGCGCTCACC GCGACCGGGC CGGGGCCGGA CGGCCATGTG GTCGGCGCGC ATCTGGAGGG GCCGTTCCTC GCGCCCGCGC GCCGTGGCGC CCAGCCGGGT GAGCATCTGC GCGGGCCTGA CCGCGGTGTG TTCGCCGAGC TCGTCGCGGC GGGCGCGGGC ACGCTGCGGG TGATCACCCT CGCCCCGGAG CTGCCCGGGG CCGGCGCGGT GACCGAGGCC GCGCTCGCGG CGGGGGTGAT CGCCGCCGCC GGCCACACGG ACGCCACCTA CGACGAGGCC GCCTCCGGTT TCGCGGCTGG CATGACGCTC GCCACCCACC TGTTCAACGG CATGCGCCCG CTGCATCACC GCGAGCCCGG CCCGGCCGGG GCGGCGCTCG ACGCCGGTGT GGCCTGCGAA CTGATCAACG ACGGTGTGCA CGTGCACCCG GCGCTGCTGC GCCTGGTCGC CGCCGAGCCG GCGCGCCTGG TGCTGGTCAC CGACGCGGTC GACGCGGCGG GTGTCGGCGA CGGCGACTAC CTGCTGGGCG GCCACCCGGT CCGGGTCCGG GACGGGCAGG CCCGCCTGGC CGCCACCGGC GCGCTCGCCG GCAGCACCCT GACGATGGAC CTGGCGGTGC GCCGCGCCGT CGCGGCCGGG CTTGCGCTCG AGGTGGCGGT CGCCGCCGCG GCGACGAACC CCGCCCGGGT GCTGGGCCTC GCCCACCGCT GCGGGTCGAT CGCCCCCGGG CTGGACGCCG ACCTCGTCGT GCTCGACGCC GATCTACGGG TCACGAGGGT CATGGCGGCC GGCAGGTGGG TCCCCGGTCC GGCTACCCGG CCGATCGCGG CCGGGTAG
|
Protein sequence | MIVLAGARVV TPHGVLDPGR VRVENGLITE VGTEAGTEVG PTAGPVGGEA GGEGTGGEDI VDLGGSWLVP GFVDLHVHGG GGHDVTASPA DLAAAVAFHR AHGTTRTLVS LVAAPVERLA EQLSWVAALT ATGPGPDGHV VGAHLEGPFL APARRGAQPG EHLRGPDRGV FAELVAAGAG TLRVITLAPE LPGAGAVTEA ALAAGVIAAA GHTDATYDEA ASGFAAGMTL ATHLFNGMRP LHHREPGPAG AALDAGVACE LINDGVHVHP ALLRLVAAEP ARLVLVTDAV DAAGVGDGDY LLGGHPVRVR DGQARLAATG ALAGSTLTMD LAVRRAVAAG LALEVAVAAA ATNPARVLGL AHRCGSIAPG LDADLVVLDA DLRVTRVMAA GRWVPGPATR PIAAG
|
| |