Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3313 |
Symbol | |
ID | 5671685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3924929 |
End bp | 3926215 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242202 |
Product | amidohydrolase 2 |
Protein accession | YP_001507622 |
Protein GI | 158315114 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.646816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGG ACGACTTGAT CCTTATCAGC GTGGACGACC ACGTGATAGA GCCCCCCGAC ATGTTCGAGG GCTTCATTCC GGCGAAGTAC GCCGACCGGG CGCCCCGGCT CGTCTCGGAC GAGCTGAGCG ACAAGTGGGT GTTCGGCGAA GGCGAGGCCC GCAGCTCCGG CCTGAACGCG GTGGCCGGCC GCCCGCCCGA GGAGTACGGC CTGGAGCCGA CGCGGCTGGC GGAGATCCGA CGGGGCTGCT ACGACGTCCA CGAGCGGGTC AAGGACATGA GCGCCAACGG CGTGCTCGCC TCCCTGAACT TCCCGTCGAT GGCCCGCTTC TGCGGTCAGT TCTTCGCCAG CCGGGCCGAC CAGGACCCCG ACCTGGCCCT TGCCGTCCTC ACCGCCTACA ACGACTGGCA CATCGACGCC TGGTGCGGCG CGTATCCGGA CCGGTTCATC CCCTGCTCGA TCCCGCCGCT GTGGGACCCC CAGCTGATGG CGAAGGAGAT CCGCCGGACG GCGGCCAAGG GCTCCCATGC GGTCAGCTTC TCGATGAACC CCTACGCCCT CGGCCTCCCG TCGTTGCACA GCGATCACTG GGACCCGTTC TGGGCGGCCT GCGAGGAGAC CGAGACGGTC GTGTGCGTGC ACATCGGGTC AGGCGCCATC GGCGTGGTCA CGGCACCCGA CGCCCCGATG AACGTCGAGA TCACCTGCGC CGCCATCAAG ACCTTCCCGA CCGCCGCGGA CCTCGTCTGG TCGCCCATCT TCCAGAAGTT CAAGAACCTC AAGGTGGCCC TGTCGGAGGG CGGGATCGGC TGGATCCCGT ACTTCCTGGA GCGGGCCGAC TACGCCTACA AGCAGCACCG CGCCTGGACC CGCCCCGAGC TCGGCGGCCG CCTGCCGAGC GAGATCTTCC GTGACCATGT CGTCACGTGC TTCATCGTCG ACGACTTCGG TGTGGCCAAC CTCGATCGGA TGAACGAGGA CATGGTCACC TGGGAGTGCG ACTACCCCCA CTCCGACAGC ACCTGGCCGC GTTCACCCGA GGTGGTGATC GACGCCGTGG CCGGACTGAC CGACCTGCAG GTGGACAAGA TCACTCATCG CAACGCGATG CAGGTGTACT CCTTCGCCCC CTTCTCGATC CGTCCACGCG AGCGGTGCAC CGTCGGCGCG CTGCGGAAGG AGGCCACCGG ACACGACATC TCGATCGTCT CGCAGGGCGT CTCGGAGCGC CGGTTGACCA CGGTCGGCCA GTTCGCCCAA GCGCACCAGC CCGGCAGGAC GGCGTGA
|
Protein sequence | MNMDDLILIS VDDHVIEPPD MFEGFIPAKY ADRAPRLVSD ELSDKWVFGE GEARSSGLNA VAGRPPEEYG LEPTRLAEIR RGCYDVHERV KDMSANGVLA SLNFPSMARF CGQFFASRAD QDPDLALAVL TAYNDWHIDA WCGAYPDRFI PCSIPPLWDP QLMAKEIRRT AAKGSHAVSF SMNPYALGLP SLHSDHWDPF WAACEETETV VCVHIGSGAI GVVTAPDAPM NVEITCAAIK TFPTAADLVW SPIFQKFKNL KVALSEGGIG WIPYFLERAD YAYKQHRAWT RPELGGRLPS EIFRDHVVTC FIVDDFGVAN LDRMNEDMVT WECDYPHSDS TWPRSPEVVI DAVAGLTDLQ VDKITHRNAM QVYSFAPFSI RPRERCTVGA LRKEATGHDI SIVSQGVSER RLTTVGQFAQ AHQPGRTA
|
| |