Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0168 |
Symbol | |
ID | 5668593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 201871 |
End bp | 202785 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239097 |
Product | dihydropteroate synthase |
Protein accession | YP_001504541 |
Protein GI | 158312033 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00447644 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.140148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAC CCCGTGACGA CCTGTTCGCC CCGGACGCCC CAACCCGGGT GATGGGCGTC GTCAACGTCA CACCGGACTC GTTCTCCGAC GGCGGCGCCT ACCCGGATCC GCATGACGCC GTCCGGGCCG CGCAGCTCAT GATCGAACAG GGCGCGGACG TCATCGACGT CGGCGGCGAG TCGACCCGTC CCGGAGCGCC GCGCGTGGTG GCCAGCGAGG AGCTGCGCCG GGTGCTCCCG GTGGTGGCCC AGCTCGCCTC CGCCGGCGTC CCGGTAAGCG TGGACACTAG CCGCGCCTCG GTCGCCTCCG CGGCCGTGGA CGCCGGCGCC CTGCTGGTCA ACGACGTCTC CTGCGGCCGG AACGCGGAGC TGCTGCGCAT CGTCGCCGAC CGCGGCGTCC ACTACGTGCT GATGCACTCC CGCGGGTCCA GCACGGACAT GGCCGCCCAC GCGGTCTACA ACGATGTGGT GACCGACGTG GTGGGCGAGC TCGCCGCCCG GCTGGACGTC GTCCTGGCGG CCGGCGTGGC GGAGGACAAG GTGATCATCG ACCCGGGCAT CGGGTTCGCG AAGACGCCGG CGCACAACTG GGCGCTGCTG GCGAACCTCG GCGCCGTCGC CTCCCTCGGC CGGCCGCTGC TCGTCGGCAC CTCGCGCAAG TCGTTCCTCG GCGCGGCGGT GAGCGGCCCG GGCGAGCTGC CCCGCCCGGC GGACGAGCGC GACGACGCCA CCCAGGCCAC GACCGCTCTG CTGGCGGACG CCGGGGTCTG GGCGGTGCGG GTGCACGCGG TCCGCCCCGC GGTGGACGCC GTCCGCGTGG TGCGGGCCTG GCAGCAGGCC GCCCGCGCCT CCGGGGCCGG CCGGCCCCGC GCGGCCGTGT CCGGCAGCGC CCCCGCGGCT GGACCATCCC GATGA
|
Protein sequence | MTGPRDDLFA PDAPTRVMGV VNVTPDSFSD GGAYPDPHDA VRAAQLMIEQ GADVIDVGGE STRPGAPRVV ASEELRRVLP VVAQLASAGV PVSVDTSRAS VASAAVDAGA LLVNDVSCGR NAELLRIVAD RGVHYVLMHS RGSSTDMAAH AVYNDVVTDV VGELAARLDV VLAAGVAEDK VIIDPGIGFA KTPAHNWALL ANLGAVASLG RPLLVGTSRK SFLGAAVSGP GELPRPADER DDATQATTAL LADAGVWAVR VHAVRPAVDA VRVVRAWQQA ARASGAGRPR AAVSGSAPAA GPSR
|
| |