Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4292 |
Symbol | |
ID | 5672647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5129980 |
End bp | 5131188 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243165 |
Product | amidohydrolase |
Protein accession | YP_001508582 |
Protein GI | 158316074 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.756284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCACGA TCAGGGCTGC CGGGCTTGTC GACGTCGACC TCGGTGAGGT CGTCAGGCCC GGCATCCTCA AGATCGACGG AGACCGGATC GTCGGTGTCG GTGGCTCGCC GGAGGGCGAG ATCATCGACC TCGGCGATCT GGTCCTCCTG CCCGGCCTCA TGGACATGGA GGTCAACCTG CTGATGGGCG GCCGCGGCGA GCATGCCGTG ACCTCTCCCG TCCGGGACGA CCCCCCGCTG CGGATGATGC GCGCCGTCGG CAACGCCCGC CGGACGTTGC GGGCGGGGTT CACCACCGTC CGCAACCTCG GCCTGTTCTG CAAGACCGGC GGCTACCTGC TCGACGTCGC GCTGATGAAG GCGATCGACG CGGGCTGGGT CGACGGCCCG CGGATCGTGC CGGCCGGGCA CGCGATCACC CCGACCGGCG GCCACCTCGA CCCGACGATG TTCGGCGCGT TCGCGCCGCA CGTCCTCGAC CTGACGGTCG AGGAGGGCAT CGCCAACGGC GTCGCCGAGG TCCGCAGGGC GGTGCGCTAC CAGATCAAGC ACGGCGCGCA GCTGATCAAG GTGTGCGCGT CGGGTGGGGT CATGTCGCAC ACCGGCCTGC CCGGGGCGCA GCACTACTCC GACGAGGAAC TGCGCGCGAT CGTCGACGAG GCGCACCGCC ACGGCCTGCG GGTCGCCGCG CACACCCACG GCGCCCAGGC GGTGCGCTCG GCCGTCGAGG CGGGTATCGA CTGCATCGAG CACGGTTTCC TCATCGACGA CGAGGCGATC GAGCTCATGG TCAAGCACGG GACGTTCCTC GTGGCGACCC AGGCCCTGAC CGAGGGCATG GACGTCTCCC ACGCGCCGCC CGAGCTGAGG GAGAAGGCGG GCCAGATCTT CCCCCGGGCC CGCAACTCGA TCCGGGAGGC GATGGCCGCC GGAGTTAAGA TCGCCGTCGG TACCGACGCC CCGGCGATCC CGCACGGCAG GAACGCGATC GAGCTGGTGA CCCTGGTCGA ACGCGGCATG ACCCCGCTCG GCGCGATCCG GGCGGCGACC ACCACCGCGG CCGATCTGCT GGCCGTCACC GACCGGGGCC GACTCGCCGA GGGCCTGCTG GCCGACGTCA TCGCCGTCGC CGGTGACCCC CTGCAGGACA TCAGCACGCT GCAGAACGTG AAATTCGTGA TGAAGGGCGG CAAGACCTTT GTCCACTGA
|
Protein sequence | MLTIRAAGLV DVDLGEVVRP GILKIDGDRI VGVGGSPEGE IIDLGDLVLL PGLMDMEVNL LMGGRGEHAV TSPVRDDPPL RMMRAVGNAR RTLRAGFTTV RNLGLFCKTG GYLLDVALMK AIDAGWVDGP RIVPAGHAIT PTGGHLDPTM FGAFAPHVLD LTVEEGIANG VAEVRRAVRY QIKHGAQLIK VCASGGVMSH TGLPGAQHYS DEELRAIVDE AHRHGLRVAA HTHGAQAVRS AVEAGIDCIE HGFLIDDEAI ELMVKHGTFL VATQALTEGM DVSHAPPELR EKAGQIFPRA RNSIREAMAA GVKIAVGTDA PAIPHGRNAI ELVTLVERGM TPLGAIRAAT TTAADLLAVT DRGRLAEGLL ADVIAVAGDP LQDISTLQNV KFVMKGGKTF VH
|
| |