Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3308 |
Symbol | |
ID | 5671680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3919551 |
End bp | 3920810 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242197 |
Product | amidohydrolase |
Protein accession | YP_001507617 |
Protein GI | 158315109 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.692035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.417277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGCG TCCCCGGTGC TACCGGCGCG ACCGTTCTGC GCGCGGCACG CTGGGTCGAC GTCGACGCGG GGACGGTGCG CTCGCCCGCG GTCGTGGTGG TCGAGGGGAA CCGCATCACC GCGGTGAACC CGGCCGTGCC GCCCACCGGC GCGACCGAGA TCGATCTGGG TGCTCTCACT CTGCTGCCGG GCCTGATGGA CATGGAGATC AATCTCCTCC TCGGTGGCCC CGAGAACCCG ACGGGCCTGC CGAACCCGCT GCACGGCGTC CAGGACGACC CGGTGTACCG GACCCTGCGG GCGACGGTGA ACGCCCGCAC CACGCTGCTG GCCGGTTTCA CGACGGTGCG CAACCTCGGG CTGATGGTCA AGACCGGGGG GTATCTGCTG GACGTGGACC TGGCCCGGGC GATCGAAGCG GGCTGGGTAC CGGGGCCCCG GATCGTGGCC GCCGGTCACG CGATCACGCC CACCGGCGGG CACCTCGACC CGACGATGTT CCAGCGGCTC GCGCCGCACA TCATGCCGCT GGGCGTCGAG GAGGGGATCG CCAACGGCGT CCCGCAGGTG CGCGCGGCGG TCCGCTACCA GATCAAGTAC GGCGCCGGGG TCATCAAGAT CTCGGCGTCG GGCGGGGTGA TGTCGCACAG CACCGCCGCC GGCGCGCAGC AGTACTCCGA CGAGGAGATC GCGGCCATCG TCGACGAGGC CCACCGGGCG GGGCTCAAGG TGGCTGCCCA CGCCCACGGC GACGCGGGCA TCCGGGCCTG TGTCCGGGCC GGGGTGGACT GCATCGAGCA CGGCTCACTG GCCAGCGACG ACACCATCCG GATGATGGTC GACCATGGGA CTTTCCTCGT CCCTACCAGC TATCTGTCGG AAGGCCTCGA CATCTCGAAG GCGGCGCCCG CGCTCCAGGC GAAGGCCGCG GAGGTCTTCC CCCGGGCTCG GCGGACGCTG GGTAGGGCCA TCGAGGCCGG GGTGCGGATC GCGTGTGGCA CCGACGCGCC CGCCATCCCG CACGGGCACA ACGCGAAGGA GCTGTGGGCT CTGGTCGACC GCGGCATGAC CGCGATGCAG GCGCTGCGGG CCGCCACGGT CACCAGCGCC GAGCTGATCG GTGTCGATGA CCGCGGTCGC CTGGCGGCTG GTCTGCTGGC CGACATCATC GCGGTTCCCG GAGATCCATC CGATGACATC ACGGCCACGC AGGACGTGCG GTTCGTGATG AAGGACGGCC TCGTCTACAA GAACGAGTAG
|
Protein sequence | MTGVPGATGA TVLRAARWVD VDAGTVRSPA VVVVEGNRIT AVNPAVPPTG ATEIDLGALT LLPGLMDMEI NLLLGGPENP TGLPNPLHGV QDDPVYRTLR ATVNARTTLL AGFTTVRNLG LMVKTGGYLL DVDLARAIEA GWVPGPRIVA AGHAITPTGG HLDPTMFQRL APHIMPLGVE EGIANGVPQV RAAVRYQIKY GAGVIKISAS GGVMSHSTAA GAQQYSDEEI AAIVDEAHRA GLKVAAHAHG DAGIRACVRA GVDCIEHGSL ASDDTIRMMV DHGTFLVPTS YLSEGLDISK AAPALQAKAA EVFPRARRTL GRAIEAGVRI ACGTDAPAIP HGHNAKELWA LVDRGMTAMQ ALRAATVTSA ELIGVDDRGR LAAGLLADII AVPGDPSDDI TATQDVRFVM KDGLVYKNE
|
| |