Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0583 |
Symbol | |
ID | 5669000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 674962 |
End bp | 676218 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239510 |
Product | amidohydrolase 2 |
Protein accession | YP_001504948 |
Protein GI | 158312440 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.972073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCG ACGACTTGAT TCTGGTGAGC ATCGACGACC ACGTGGTCGA ACCAGCGGAC ATGTTCAAGA ACCACCTGCC GGCGAACCTG GCCGACCAGG CGCCGCACGT CGAGACCGAC GAGTCCGGCG TGGACCGGTG GATCTACCAG GGCCGGGTCA CCGGGGTCAG CGGCCTCAAC GCCGTGATCA CCTGGCCGCC GGAGGAATGG GCCAAGGACC CGGCCGGCTT CGCCGAGATG CGCCCCGCCG TCTACGACAT CCACGACCGG GTCCGGGACA TGGACCGTAA CGGGATCCTG GCGTCGATGT GCTTCCCGAC GTTCGCCGGG TTCAGCGCCG GGCATCTCAA CCATTTCAAG GATCCCCTCA CCGTCATAAT GATCCAGGCG TACAACGACT GGCACATCGA CGAGTGGGCC GGCACCTACC CGGGCCGGTT CATCCCGCTG GCGCTGCTCC CGACCTGGGA CCCGCAGCTG ATGGTGAACG AGATCCGCCG GGTGGCGGCG AAGGGCTGCC GGGCGGTCAC CATGCCCGAG CTGCCGCACC TCGAGGGCCT GCCCAGCTAC CACAACCTCG ACTTCTGGGC TCCGGTGTTC GAGGCGCTGT CCGACACCGG GATGGTGATG TGCCTGCACA TCGGAACCGG GTTCGGCGCG CTCAAGCTCG CCCCGGACGC GCCGATCGAC AACCTGATCA TCCTGGCGTG CCAGATCTCC TCGCTGGCCG TGCAGGACCT GTTGTGGGGC CCGGCGATGC GGACCTACCC GGACCTGAAG TTCGCCTTCT CCGAGGGCGG CATCGGCTGG ATCCCGTTCT ACCTGGACCG CTGCGACCGG CACTACACCA ACCAGCGCTG GCTGCGCCGC GACTTCGGCG GCAAGCTGCC CAGCGAGGTG TTCCGCGACC ACTCACTCGC CTGCTACGTC ACCGACCCGA CGTCGCTGAA GCTGCGCCGT GAGATCGGGA TCGACATCAT CGCCTGGGAG TGCGACTACC CGCACGCCGA CTCGATCTGG CCCGAGGCGC CGGAGTTCGT GCTCAACGAG CTGAACAACG CCGGTGCGAC CGACGAGGAG ATCGACAAGA TCACCTGGCG GAACGCCTGC CGGTTCTTCA ACTGGGACCC GTTCTCCGAG ATCCCCAAGG AGCGCGCGAC CGTCGGCGCC CGTCGCGCGA TCGCGACCGA CGTCGACACC ACCATCCGCT CCCGCAAGGA ATGGGCCCGC CTCTACGCGC AGCGACAGAC CACCTGA
|
Protein sequence | MNVDDLILVS IDDHVVEPAD MFKNHLPANL ADQAPHVETD ESGVDRWIYQ GRVTGVSGLN AVITWPPEEW AKDPAGFAEM RPAVYDIHDR VRDMDRNGIL ASMCFPTFAG FSAGHLNHFK DPLTVIMIQA YNDWHIDEWA GTYPGRFIPL ALLPTWDPQL MVNEIRRVAA KGCRAVTMPE LPHLEGLPSY HNLDFWAPVF EALSDTGMVM CLHIGTGFGA LKLAPDAPID NLIILACQIS SLAVQDLLWG PAMRTYPDLK FAFSEGGIGW IPFYLDRCDR HYTNQRWLRR DFGGKLPSEV FRDHSLACYV TDPTSLKLRR EIGIDIIAWE CDYPHADSIW PEAPEFVLNE LNNAGATDEE IDKITWRNAC RFFNWDPFSE IPKERATVGA RRAIATDVDT TIRSRKEWAR LYAQRQTT
|
| |