Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4205 |
Symbol | |
ID | 3907170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5021229 |
End bp | 5022065 |
Gene Length | 837 bp |
Protein Length | 278 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637881533 |
Product | 2-amino-3,7-dideoxy-D-threo-hept-6-ulosonate synthase |
Protein accession | YP_483282 |
Protein GI | 86742882 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1830] DhnA-type fructose-1,6-bisphosphate aldolase and related enzymes |
TIGRFAM ID | [TIGR01949] predicted phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.958913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGAAC AGCACTTCGC CCGTGCGCTG AGGATGGCCC GGCTCCACCG CTGGCACCCC GCCAGGCTGG CGATCACACC GCTCGACCAC TCGATCAGCG ACGGACCCGT GGTGCCGCGC GGCACCACCA TCGACGGCCT CGCAGGTCAG CTGGTGGAGG CCGGCGTGGA CGCCATCGTC GTCCACAAGG GCAGCCTCCG GCACCTTCGT CCCGCCCGTC TCACCGGCAT GTCGGTGATC GTCCATCTCA ACGCAAGCAC CGCTCACGCA CCCGATCCGG ATGCCAAGTA CCTGGTCACC GGAGTGGAGG AAGCGCTCAG GCTGGGTGCG GACGCAGTCA GTCTGCACGT CAACCTCGGC TCCCTCGACG AACGCCAGCA GATCGGGGAC CTCGGCCGGG TCGCGGAGCG CTGCGAGCAG TGGAACCTGC CGCTGCTCGC GATGATGTAC CCGCGCGGGC CGCGGATCAG CGACCCGCAT GATCCGGAGC TGATCGCGCA CGCGGTCACA CTCGCCGTGG ACCTGGGCGC GGACCTGGTC AAGGCCCCCT TCCCCGGGTC CGTCCCGGCC CTGCGCGACC TGACGGACGC CTGCCCCGTC CCCCTGCTGT GCGCCGGCGG ACCCCGCCGC AGTGAGGACG ACGTTCTGGC GTACGTACGC GACGTGCTGC ACGGCGGGGC CGCCGGCGTG GCCATGGGCC GCAGCATCTT CCAGGCCGAC GACCCGCGGC GGATGGCTGC GGCGGTGGCC CAACTGGTCC ATGCGGAATC CGAGCCTCGT CTCGAACCGA CTGCAGAAGG GCAGCGAAGT GAACGCAAGG AAGCTGTGCT GGCTTGA
|
Protein sequence | MTEQHFARAL RMARLHRWHP ARLAITPLDH SISDGPVVPR GTTIDGLAGQ LVEAGVDAIV VHKGSLRHLR PARLTGMSVI VHLNASTAHA PDPDAKYLVT GVEEALRLGA DAVSLHVNLG SLDERQQIGD LGRVAERCEQ WNLPLLAMMY PRGPRISDPH DPELIAHAVT LAVDLGADLV KAPFPGSVPA LRDLTDACPV PLLCAGGPRR SEDDVLAYVR DVLHGGAAGV AMGRSIFQAD DPRRMAAAVA QLVHAESEPR LEPTAEGQRS ERKEAVLA
|
| |