Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2859 |
Symbol | |
ID | 5671248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3374306 |
End bp | 3375382 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641241768 |
Product | amidohydrolase 2 |
Protein accession | YP_001507188 |
Protein GI | 158314680 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAACG ACGCATTCGT CTTCAACGCG GTCGCGCACG CCTACGACCT TACCGACGAA AACACCCAGC CAAACCGGTA TGCCCACGGT TTACGGGACG CGCTCGTCAT GCTACACCGG GACTGGCAGC CGGGCATCGG TCTCGGCGAG TTCGAGCAGC GCACGGACTG GTCGATCGAG ACGCTGGCCA GAACGTTGTT TCTCGAGTCC GACGTGGACA TGGCGGCCAC GCATACGCTC CGACTCGACT CGTATTTCAA GGACGGTCTG TGCGCGCGAC ACAAGACCGT CGAAGCCGTG CGGCGGTGGC CGCACCGGTT CGTCGGCTAC GTGGGCGTCG ACCCGACCCT CGGACTCGAG ACCTGTCTGC GGGAGCTCGA CGAGCAGCTC GACGAGATGC CCGAAGCGGT GGGCCTGAAG CTCTACCCCG CGCAGGTCGA ACCCATCCGC AGCTGGCGTA TGGATGATCC AAAGCTCGCC TTCCCGCTGT TCGCCCGCGC CCAGGAACGC GGTCTCAGGA CGGTTGCCGT CCACAAGGCG TCGCCGCTCG GCCCAGTGCC GATGAACCCG TTCCACCTCG ATGACATCGA CAACGCCGCG GACGCCTTTC CCGATCTCTC GTTCGAGATC GTCCACGCCG GACTGGCGTT CGCCGAGGAG GCGGCACTCG CGATCGCCCG CTACCCGAAC GTCTACGCTA ACCTCGAGGT CACGTCCGTC CTCCTCACCA AGAGTCCGGG GGTGTTCGAG CAGACGCTGG CAAAGTTGAT CTTCTGGGGT GGCCCCAGTA AGCTGATCTA CTCTGACGGC AGCATGGTCT TTCACTCGCA GCCGATCATC AGGGCCCTCA GCGAGTTCTC CTTCAGCGAG GAGACCCTCG CCGCCTGGGG AATCCCGCAA CTTACCGCCG AGGACCGGGC GTTGATTCTT GGTGGCAACT ACGCCCGCAT ACTCGGGATC GACATCGAGG CCGCGAAGGC GCGGATAGCC GATGACGAGT TCGCCAGGGA AAAGGCCCGA ACGGGGATTC AGGCACCCTA CTCCAACTGG AAGGCCGCGC TCGAGGCCGC GGCATGA
|
Protein sequence | MINDAFVFNA VAHAYDLTDE NTQPNRYAHG LRDALVMLHR DWQPGIGLGE FEQRTDWSIE TLARTLFLES DVDMAATHTL RLDSYFKDGL CARHKTVEAV RRWPHRFVGY VGVDPTLGLE TCLRELDEQL DEMPEAVGLK LYPAQVEPIR SWRMDDPKLA FPLFARAQER GLRTVAVHKA SPLGPVPMNP FHLDDIDNAA DAFPDLSFEI VHAGLAFAEE AALAIARYPN VYANLEVTSV LLTKSPGVFE QTLAKLIFWG GPSKLIYSDG SMVFHSQPII RALSEFSFSE ETLAAWGIPQ LTAEDRALIL GGNYARILGI DIEAAKARIA DDEFAREKAR TGIQAPYSNW KAALEAAA
|
| |