Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1408 |
Symbol | |
ID | 5669814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1704654 |
End bp | 1705862 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240331 |
Product | amidohydrolase |
Protein accession | YP_001505758 |
Protein GI | 158313250 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.321357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACATCG ACGCCGGTGT CGTGCGGTCG CCCGCGGTGA TCGTGGTGGA GGGTGGTCGC ATCACCGCGA TCGGCCCCAC CGAGGTCCCC GCGAACGCGA CCGAGTTCGA CCTGGGCGAT GTCACGCTCC TGCCTGGGCT CATGGACATG GAGCTGAACC TGCTCATCGG CGGTCCGGGC GGCCCGGACG GGCTACCCAA CCCGATGCAT GGCGTCCAGG ACGACCCGGT GTACCGGACC CTGCGTGGCG CGGTGAACGC CCGCGCCACG CTCCAGGCCG GCTTCACCAC CGTGCGCAAC CTGGGGCTGA TGGTCAAGAC CGGCGGGTAC ATGCTCGACG TCGCGCTGCA GCGGGCGATC GACCAGGGCT GGCACGAGGG CCCCCGCATC GTCCCCGCGG GCCACGCCGT CACGCCGTAC GGCGGCCACC TCGACCCGAC GGTGTTCCAG CGCCTCGCAC CCGGGATCAT GCCGCTCAGC GTGGCCGAGG GCATCGCCAA CGGCGTGGCC GAGGTGCGCG CCTGCGTCCG ATACCAGATC CGCCACGGCG CCAAGGTGAT CAAGATATCG GCGTCGGGGG GCGTGATGTC GCACTCCACG GCGCCGGGCT CGCAGCAGTA CTCCGACGAG GAGTTCGTCG CGATCGCCGA CGAGGCGCAC CGGGCGGGAA TCCGGGTCGC CGCGCACGCG GTCGGCGACA GCTCGGTCCA GGCCTGCATC CGGGCCGGTG TGGACTGCAT CGAGCACGGC TTCCTCGCCA CCGACGAGTC CATCCAGATG ATGGTCGACC ACGGGACGTT TCTGGTGTCG ACCACATACC TCACCGAGGC GATGGCCATC GAGCGGGCCG CGCCCGAACT CCAGAAGAAG GCCGCTGAGA TCTTTCCTCA GGCCAAGGCG ATGCTGCCCA AGGCGATCGC CGCCGGAGTG AAGATCGCGT GCGGTACGGA CGCGCCGGCG ATCCCGCACG GGGAGAACGC CATGGAGCTC ATCGCGCTGG TCGACCGTGG CATGACTCCG ATGCAGGCGC TGCGGGCGGC GACCGCGACG AGCGCGGAGC TCATCCAACG GGAGGACGAG CTCGGCCAGC TCGCCGTCGG GTATCTCGCC GACATCATCG CGGTGCCTGG TGATCCGTCC GAGGACATCG CCGCCACACG GGACGTCCGT TTCGTTATGA AGGAGGGCCG TGTCTACAAA CACCTCTGA
|
Protein sequence | MDIDAGVVRS PAVIVVEGGR ITAIGPTEVP ANATEFDLGD VTLLPGLMDM ELNLLIGGPG GPDGLPNPMH GVQDDPVYRT LRGAVNARAT LQAGFTTVRN LGLMVKTGGY MLDVALQRAI DQGWHEGPRI VPAGHAVTPY GGHLDPTVFQ RLAPGIMPLS VAEGIANGVA EVRACVRYQI RHGAKVIKIS ASGGVMSHST APGSQQYSDE EFVAIADEAH RAGIRVAAHA VGDSSVQACI RAGVDCIEHG FLATDESIQM MVDHGTFLVS TTYLTEAMAI ERAAPELQKK AAEIFPQAKA MLPKAIAAGV KIACGTDAPA IPHGENAMEL IALVDRGMTP MQALRAATAT SAELIQREDE LGQLAVGYLA DIIAVPGDPS EDIAATRDVR FVMKEGRVYK HL
|
| |