Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4731 |
Symbol | |
ID | 5673073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5647311 |
End bp | 5648612 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243588 |
Product | amidohydrolase 2 |
Protein accession | YP_001509004 |
Protein GI | 158316496 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGT CGGACATGAT CCTGATCAGC ATTGACGATC ATTCGATCGA GCCTCCGGAC ATGTACAAGC GGCATGTTCC GGCGAAGTGG CTGGATCAGG CTCCGAAGGT CGTGCGCGGT TCGGAAGGCG CGGATCTCTG GGTGTTCCAG GGCCAGGCGA CCGCGACGCC GTTCGGCATG GCGGCGACCG TCGGCTGGCC TCGGGAGGAG TGGGGATTCC ATCCCGGCTC GATGTCCGAG ATGCGGCCGG GCTGTTTCGA CGTCCACGAG CGCGTCCGTG ACATGAACGT GAACGGCGTG CTGGCGTCGA TGTGCTTCCC GACAATGGCC GGCTTCAACG CCCGGACCTT CAGCGAGGCC GCCGACAAGC AGCTGTCCTT CATCATGCTC CAGGCGTACA ACGACTGGCA CATCGACGAG TGGTGCGCCA CCTACCCCGG CCGGTTCATC CCGCTGGGAA TCGTCCCCAT GTGGGACGTC GACCTGGCGG TCGCGGAGGT CCGGAGAATC GCTCGGAAGG GGTGCCGCGC GATCAGTTTC CTGGAGGCGC CGCACGGGCA GGGCTGGCCG AGCTTCCTGT CGGGCTACTG GGACCCGATG CTGAAGGCGA TCGTCGACGA GGACATGGTG CTCTGCCTGC ACATCGGCGG CGCCTGGGAC ATCGTGCAGC TGGCTCCCGA GGCGCCAGTC GACCATTCCA TCGTGATACC GTCGCAGCTG ACGATGCTCA CCGCGCAGGA CCTGCTGTTC GGTCCCACGC TGCGCCGGTT CCCGAAGCTG CGGGTCGCCC TCTCCGAGGG TGGAATCGGC TGGATCCCGT TCTATCTCGA CCGTATCGAC CGGCATTTTC AGAACCAGAC CTGGATCGAC GGCGACTTCG GCGGGAAGCT TCCGTCCGAG GTCTTCCGCG AGCACTTCCT GGCGTGTTTC ATCACCGACC CCGCCGGCCT CAAGATGCGG CACGATATCG GCGTCGACGT GATCGCCTGG GAGTGCGACT ACCCGCACAC CGACACGACC TGGCCGGAGT CGCCGGAGTT CGTGTGGAAC GAGCTGCAGA CCGCCGGCGT CCCGGATGCC GAGATTCACC AGATCACCTG GGAGAACAGC GCCCGCTTCT TCAACTGGGA CCCTTTCAGT CACGTCCCGA AGGAGCAGGC GACGGTGGGG GCGCTGCGCG CGCAGGCGGC TGACGTGGAC GTGACGCGGA TGTCGCGCGC GGAATGGCGC CGGCGCAACG AGGCCGCCGG AGTGGGCCGG GTGGAGGCCG ACGGGCTCCA GGACGTGGGC GGGCCTCGAT GA
|
Protein sequence | MQQSDMILIS IDDHSIEPPD MYKRHVPAKW LDQAPKVVRG SEGADLWVFQ GQATATPFGM AATVGWPREE WGFHPGSMSE MRPGCFDVHE RVRDMNVNGV LASMCFPTMA GFNARTFSEA ADKQLSFIML QAYNDWHIDE WCATYPGRFI PLGIVPMWDV DLAVAEVRRI ARKGCRAISF LEAPHGQGWP SFLSGYWDPM LKAIVDEDMV LCLHIGGAWD IVQLAPEAPV DHSIVIPSQL TMLTAQDLLF GPTLRRFPKL RVALSEGGIG WIPFYLDRID RHFQNQTWID GDFGGKLPSE VFREHFLACF ITDPAGLKMR HDIGVDVIAW ECDYPHTDTT WPESPEFVWN ELQTAGVPDA EIHQITWENS ARFFNWDPFS HVPKEQATVG ALRAQAADVD VTRMSRAEWR RRNEAAGVGR VEADGLQDVG GPR
|
| |