Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6547 |
Symbol | |
ID | 5674862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7962932 |
End bp | 7964074 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245396 |
Product | amidohydrolase 2 |
Protein accession | YP_001510790 |
Protein GI | 158318282 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.143744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCC GACTCGCCCG CACAGCGGTC CTGGCGATGC TGCGCGCGGG AACAGCAGAC CCCCGGCTCT CCCGCTGGCA GCCGCGGCCG GAGCTGCGGG TACCGGCGAC AACGGTGGGC CGCGCCCGCT GGCCGGCGGT GGACGCGCAC AACCATCTGG GCCGCTGGCT ATGCCGGGAC GGCGGATGGA TGGTTCCGGA CGTGGGCGCC TTCCTGGCGC TGATGGACGA GCTGAACATC ATGACAGTGG TGAACCTGGA CGGCCGGTGG GGCGCCGAGC TGGCGGCTAA CCTGGATCGG TACGACCGGG CGCACCCGGG TCGGTTCCTC ACCTACTGCC ATCTGGACTG GTCCCTGCTG CGATCGCGGC GGCCCAGCGC GGCACTGGTG GCATCGCTGC GCACGTCCGC GGACGCCGGC GCGCGCGGGC TGAAGATCTG GAAGGACCTG GGGTTGCGGG TACGAGACGC CCGTGGGCGA CGGGTGCTGC CGGACGACCC TCGGCTGTCG GAGATGTTCG ACGTCGCAGG GGAGCTTGGC CTGCCGGTGA TGATCCACGT CGGTGACCCG GTGGCCTTCT TCCGCCGTCC GGACCGGCAC AATGAACGTC TCGACGAGCT GCGGCGGCAT CCCACCGCTG CGCGTGCCGG CGTTTCCCGC AGATTCCCGG GCTGCGGCGG GCCGGGCCTG CCCCGGCTGG TCGAGGCGTT GGAGTCGACG GTGGCCGCAC ACCCGCGCAC CACGTTCGTG GCCGCCCACG TCGGCTGCCT TGCGGAGGAT CTCGGGCGGG TCGAACGGAT GCTCGACGCC CACCCGAACC TGGTGGTGGA CATCTCCGCC CGGCTCGCCG AGCTTGGCCG GCAGCCGCGG GTCGCCAACC GGTTCATCAC CCGTTTCGCG GATCGGGTGT TGTTCGGGAC GGACGCGTTC CCACCATCCG CCGATGGCTT CCGCGGGTAC TTCCGACTGC TCGAGACCGA CGACGACCAC TTCCCCTACT CGGCGGAATG GCCGCCGCCG CAGGGCCGGT GGGCCGTCTA CGGCCTGGGC CTCGAGCCCG ACGTATTGCG CCGGGTGTAC GGGACCAATG CGGCCCGGCT TCTCCCGGGG GTGCCCGATC CAGAGCACGC TGTCCACGTC TAA
|
Protein sequence | MTVRLARTAV LAMLRAGTAD PRLSRWQPRP ELRVPATTVG RARWPAVDAH NHLGRWLCRD GGWMVPDVGA FLALMDELNI MTVVNLDGRW GAELAANLDR YDRAHPGRFL TYCHLDWSLL RSRRPSAALV ASLRTSADAG ARGLKIWKDL GLRVRDARGR RVLPDDPRLS EMFDVAGELG LPVMIHVGDP VAFFRRPDRH NERLDELRRH PTAARAGVSR RFPGCGGPGL PRLVEALEST VAAHPRTTFV AAHVGCLAED LGRVERMLDA HPNLVVDISA RLAELGRQPR VANRFITRFA DRVLFGTDAF PPSADGFRGY FRLLETDDDH FPYSAEWPPP QGRWAVYGLG LEPDVLRRVY GTNAARLLPG VPDPEHAVHV
|
| |