Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0715 |
Symbol | |
ID | 5669131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 831541 |
End bp | 832806 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239642 |
Product | amidohydrolase 2 |
Protein accession | YP_001505079 |
Protein GI | 158312571 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.548307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.202606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACATGG ACGACATGAT TCTGCTGAGC ATCGACGACC ACGTGATCGA GCCGCCGGAC ATGTACAAGA ACCATGTCCC GGCGAAGTGG CTCGATTCCG TGCCGAAGGT CGTCCGGAAC GAGGCCGGCG TCGACGAGTG GGTGTTCCAG GGCGAGAAGA CGTCCACACC GTTCGGTATG GCGGCGACCG TCGGCTGGCA CCGGGAGGAG TGGGGATTCA ACCCCGGCGC CTTCACCGAG TTACGTCCGG GCTGTTTCGA GGTCCACCAG CGGGTCCGCG ACATGAACGC CAACGGTGTC CTCGCCTCGA TGTGCTTCCC GACGATGGCG GGCTTCAACG CCCGCACGTT CTCCGAGGCC CTCGACAAGG ACCTCTCGCT CATCATGCTG CAGGCCTACA ACGACTGGCA CATCGACGAG TGGTGCGGCG CCTACCCGGG CCGGTTCATC CCCCTCGGCA TCGTGCCGAT GTGGGACGTC GAGCTCGCGG TGAAGGAGAT CCGGCGGATC GCCGCGAAGG GCTGCCGCTC CATCAGCTTC CTGGAGGCCC CCCACGCGCA GGGCTGGCCG AGCTTCCTCT CCGGCCACTG GGACCCGATG CTGCAGGCCC TCGTCGACGA GAACATGGTG CTCAGCCTGC ACATCGGCGG CGCCTGGGAC ATCGTCAAGC TCGCCCCCGA GGTGCCGATC GACCACATGA TCGTCATTCC GTCCCAGCTC ACCATGCTCA CCGCGCAGGA CCTGCTCTTC GGCCCGACAC TGCGGCGCTT CCCCGAGCTG AAGGTGGCCC TCTCCGAGGG TGGCATCGGC TGGATCCCGT TCTACCTGGA CCGCGTCGAC CGGCACTTCC AGAACCAGAG CTGGATCCAC AACGACTTCG GCGGCAAGCT GCCCTCCGAG GTGTTCCGGG AGCACTTCCT GGCCTGTTAC ATCACCGACC CGGCCGGGCT GCGCCTGCGC GAGCAGATCG GCATCGAGAC CATCGCCTGG GAGTGCGACT ACCCGCACAC CGACACGACC TGGCCCGAGT CACCCGAGCA CGCCTGGAAC GAGCTCCAGC AGGCCGGCTG CCGCGACGAC GAGATCCACC AGATCACCTG GGAGAACGCC AGCCGCTTCT TCGGCTGGGA CCCGTTCTCC CACACGCCGA GGGAGCAGGC CACCGTGGGC GCGCTGCGCG GGCTGGCCGC CGATGTCGAC GTCACCCGGA TGTCGCGCGA GGAGTGGCGC AAGCGCAACG AGGCCGCGGG AATCGGCGTC TTCTAA
|
Protein sequence | MHMDDMILLS IDDHVIEPPD MYKNHVPAKW LDSVPKVVRN EAGVDEWVFQ GEKTSTPFGM AATVGWHREE WGFNPGAFTE LRPGCFEVHQ RVRDMNANGV LASMCFPTMA GFNARTFSEA LDKDLSLIML QAYNDWHIDE WCGAYPGRFI PLGIVPMWDV ELAVKEIRRI AAKGCRSISF LEAPHAQGWP SFLSGHWDPM LQALVDENMV LSLHIGGAWD IVKLAPEVPI DHMIVIPSQL TMLTAQDLLF GPTLRRFPEL KVALSEGGIG WIPFYLDRVD RHFQNQSWIH NDFGGKLPSE VFREHFLACY ITDPAGLRLR EQIGIETIAW ECDYPHTDTT WPESPEHAWN ELQQAGCRDD EIHQITWENA SRFFGWDPFS HTPREQATVG ALRGLAADVD VTRMSREEWR KRNEAAGIGV F
|
| |