Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3084 |
Symbol | |
ID | 3904210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3654389 |
End bp | 3655789 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637880405 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_482170 |
Protein GI | 86741770 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACCG CTCGAGCTCG ACCTGGGCCG TACCCTGGTG TGGTGAGCGT ACTGGATCTC TGGCGGACCC TTCCGGCCAA GCAGCAGCCG TCCTGGCCCG ACCCGGCAGA GTTGAGAGCC GCCCATGACC AGCTTGCGGC CCTCCCCCCG CTGGTCACCG CGCCCGAGAT CCGGTCCCTG ACCGACCGTC TCGCGCTGGT CGCTCGGGGC GAGGCGTTCC TGTTGCAGGG CGGTGACTGC GCCGAGACCT TCGTGGCGAA CACCGCCGAC AAGATTCGGG ACAAGGTCAA GACGCTGCTG CAGATGGCGG TCGTCCTCAC CTATGGCGCG AGCACGCCGG TGGTGAAGGT CGCCCGCATC GCCGGGCAGT ACGCCAAGCC GCGGTCCGCG GACATCGAGG CCAGCACCGG GCTGCCCTCC TACCGGGGCG ACGCCGTCAA CGACATCGCG CCCACGGCCG CCGCGCGGCA GCCCGATCCG ATGCGGATGG TCGCCGCCTA CCACCAGAGC GCAGTCGCGC TGAACCTCGT GCGGGCCTTC GCCACCGGAG GTTTCGCCGA CCTGTCGAAG GTCCACGAGT GGAACAAGGC GTTCGTGCGG GATTCCGCGG CGGGTCGCCG CTACGAGGTG ATGGCGACGG ACATCGAGCG TGCCCTCGCC TTCATGGCCG CCTGCGGCAT CGACCTCGAT CGGACCGCCG CGCTGACCGG GGTCGAGCTG TTCACGAGTC ACGAGGGCCT CCTGCTGGAG TACGAGCGTG CCCTGACCCG GATCGAGGAC GCCACCGGCG ACCCGTACGA CCTGTCCGCC CACATGATCT GGATCGGCGA GCGGACCCGG GACCTCGACG GTGCCCACGT CGACCTGCTG TCCCGGGTGG GCAATCCGAT CGGCTGCAAG ATCGGGCCGA AGGCTGGCCC GGACGAGGTC CTCGAGCTTG CCGAGCGGCT CAACCCCGAC CATGTCCCCG GCCGGCTCAC CCTGATCTCC CGGATGGGGG CAAAGCGGGT CCGGGACACG CTGCCCCCGA TCATCGAGAA GGTCAACGCC GCCGGGCCGC CAGTGGTGTG GTCGTGCGAT CCGATGCACG GCAACACCCG TGACGTGGGC GGCGTCAAGA CGCGCCACTT CGACGATGTC CTCGACGAGG TCTTCGGCTT CTTCGAGGTC CACAAGGCGT TGGGCACCCA CCCCGGTGGC CTGCACATCG AGCTGACCGG GGAGAACGTC ACCGAGTGCC TCGGTGGGGC CGAACTCATC GGGGAAGACG ACCTCGGCGG CCGTTACGAG ACCGCCTGCG ACCCCCGTCT GAACACCGGG CAGGCGCTAG AACTGGCCTT CCTCGTCGCC GAGATGCTGC AGAACACGCG GGGAGAACGC GGCGCGCCGT GGCCCTCGTG A
|
Protein sequence | MVTARARPGP YPGVVSVLDL WRTLPAKQQP SWPDPAELRA AHDQLAALPP LVTAPEIRSL TDRLALVARG EAFLLQGGDC AETFVANTAD KIRDKVKTLL QMAVVLTYGA STPVVKVARI AGQYAKPRSA DIEASTGLPS YRGDAVNDIA PTAAARQPDP MRMVAAYHQS AVALNLVRAF ATGGFADLSK VHEWNKAFVR DSAAGRRYEV MATDIERALA FMAACGIDLD RTAALTGVEL FTSHEGLLLE YERALTRIED ATGDPYDLSA HMIWIGERTR DLDGAHVDLL SRVGNPIGCK IGPKAGPDEV LELAERLNPD HVPGRLTLIS RMGAKRVRDT LPPIIEKVNA AGPPVVWSCD PMHGNTRDVG GVKTRHFDDV LDEVFGFFEV HKALGTHPGG LHIELTGENV TECLGGAELI GEDDLGGRYE TACDPRLNTG QALELAFLVA EMLQNTRGER GAPWPS
|
| |