Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0094 |
Symbol | |
ID | 5668519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 111434 |
End bp | 112696 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239022 |
Product | amidohydrolase 2 |
Protein accession | YP_001504467 |
Protein GI | 158311959 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.986217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCG ACGACATGGT CTTGGTGAGC ATCGACGACC ACGTGGTCGA GCCGCCCGAC ATGTTCAAGA ACCACGTCCC GGCGAACCTG GTGGACCAGG CGCCGCACGT CGTGCGCAAC GACAAGGGCG TGGACCAGTG GATCTACCAG GGCCGGGTGA CGGGCGTCAG TGGCCTGAAC GCGGTCGTGT CGTGGCCGGC GGAGGAGTGG GGCAAGGACC CGGCCGGCTT CGCCGAGATG CGCCCGGGGG TGTACGACAT CCACGACCGG GTCCGGGACA TGGACCGTAA CGGGATCCTG GCGTCGATGT GCTTCCCGAC GTTCGCCGGG TTCTCCGCCG GGCATCTCAA CCACTACAAG ACCGACACCA CGGTCACGAT GGTCCAGGCG TACAACAACT GGCACATCGA CGAGTGGGCC GGCACCTACC CGGGCCGGTT CATCCCGCTG GCGCTGCTCC CGACCTGGGA CCCGCAGCTG ATGGTGAACG AGATCCGCCG GGTGGCGGCG AAGGGCTGCC GGGCGGTCAC CATGCCCGAG CTGCCGCACC TCGAGGGCCT GCCCAGCTAC CACAACCTCG ACTTCTGGGC TCCGGTGTTC GAGGCGCTGT CCGACACCGG GATGGTGATG TGCCTGCACA TCGGAACCGG GTTCGGCGCG CTCAAGCTCG CCCCGGACGC GCCGATCGAC AACCTGATCA TCCTGGCGTG CCAGATCTCC TCGCTGGCCG TGCAGGACCT GTTGTGGGGC CCGGCGATGC GGACCTACCC GGACCTGAAG TTCGCCTTCT CCGAGGGCGG CATCGGCTGG ATCCCGTTCT ACCTGGACCG CTGCGACCGG CACTACACCA ACCAGCGCTG GCTGCGCCGC GACTTCGGCG GCAAGCTGCC CAGCGAGGTG TTCCGCGACC ACTCGCTCGC CTGCTACGTC ACCGACCCGA CGTCGCTGAA GCTGCGCCGT GAGATCGGGA TCGACATCAT CGCCTGGGAG TGCGACTACC CGCACTCGGA CTCGATCTGG CCGGACGCGC CGGAGTTCGT GCTCAACGAG CTGAACAACG CGGGTGCGAC CGACGAGGAG ATCGACAAGA TCACCTGGCA GAACGCCTGC CGGTTCTTCA ACTGGGACCC GTTCTCCGAG ATCCCCAAGG AGCGCGCGAC CGTCGGCGCC CGCCGGGCCA TCGCCACCGA CGTCGACACC GCCATCCGCT CCCGCAAGGA ATGGGCCCGC CTCTTCGCGG AGAAGCACCC CGAGACCATC TGA
|
Protein sequence | MNIDDMVLVS IDDHVVEPPD MFKNHVPANL VDQAPHVVRN DKGVDQWIYQ GRVTGVSGLN AVVSWPAEEW GKDPAGFAEM RPGVYDIHDR VRDMDRNGIL ASMCFPTFAG FSAGHLNHYK TDTTVTMVQA YNNWHIDEWA GTYPGRFIPL ALLPTWDPQL MVNEIRRVAA KGCRAVTMPE LPHLEGLPSY HNLDFWAPVF EALSDTGMVM CLHIGTGFGA LKLAPDAPID NLIILACQIS SLAVQDLLWG PAMRTYPDLK FAFSEGGIGW IPFYLDRCDR HYTNQRWLRR DFGGKLPSEV FRDHSLACYV TDPTSLKLRR EIGIDIIAWE CDYPHSDSIW PDAPEFVLNE LNNAGATDEE IDKITWQNAC RFFNWDPFSE IPKERATVGA RRAIATDVDT AIRSRKEWAR LFAEKHPETI
|
| |