Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2692 |
Symbol | |
ID | 5671083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3185301 |
End bp | 3186479 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241604 |
Product | amidohydrolase 2 |
Protein accession | YP_001507024 |
Protein GI | 158314516 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGACC AACGCCCCAT CGACGCCGAC AACCACTACT ACGAGCCGCT GGACGCGTTC ACCCGCCACC TCGACCCGGC GTTCACCCAA CGCGGCGTGC AGGTCCTGCA GAAGGGCAAG CGCGTCGTCG TGGTGATCGG CGGCCGGGTC AACACCTTCA TCCCCAACCC CACCTTCAAT CCGGTGACCA AGCCTGGCTG CCTCGACCTG TACTTCCGCG GGGTCATGCC CGAGGGGGTG AGCCGGCGGA CCCTGATGGA GGTCGAACCC CTGGCACCGG AGTACCGCGA CCGCGACGTG CGGATAGCCC GACTCGACGA GCAGGGCCTG GCCGGCGCGG TGCTGTACCC GACGATGGGC GTCGGAGTCG AGGAAGCGCT GCGTGACGAC GTCCCGGCGA CCATGGCCAG CCTGCACGCG TTCAACCGGT GGCTGGAGGA CGACTGGGGC TACTCCTATC AGGACCGCCT GTTCGCCGTG CCGCTGATCT CCCTGGCTGA TCCGCAGGCA GCGGTCGCCG AGGTGGAGCG GGTGCTCGGC CTCGGTGCGC GCATCGTCCA CGTCCGCCCC GCACCCGTGC CCGCACCGGG GACGGGAACC AGCGGGCGGT CGCTGGGCCA TCCCGCGCAC GACCCGGTGT GGGCGCGCCT CGCGGAGGCG GACGTACCGG TGGCGTTCCA CCTGGGGGAC AGCGGCTACC ACCGGATATC GGCGATGTGG GGCGGTTCGG CGACCCTGGA GGCGTTCGGG AAGACGAACG TCCTCGCCAA GATCGTCGTC GGGGAGCGGG CCATCCAGGA CACGATGGCC AGCCTCGTCG TCGACGGCGT GTTCGCCCGC CACCCGCGGC TGCGGGCGGT GAGCATCGAG AACGGCTCGT CCTGGGTGAA GCCGCTGCTG CGGCTGATGA AGAAGTACGC CAACCAGTCG CCGGAGAGCT TCTCCGGCAA CCCGGTCGAA GCGTTCACCG AGCACGTGTG GGTGGCGCCC TACTACGAGG ACGACATCGC CGGGCTGGTC GAGCTCATCG GCGCCGACCA CGTCCTGTTC GGATCGGACT GGCCGCACGC CGAGGGCCTG GCCGAACCGC TCCAGTTCGA CAAGGAGATC GAGTGCTTCG ACGCCACCAC CAAGGCTCGG ATCATGCGGG GCAACTCCGC CGCGCTCCTC GGGCTGTGA
|
Protein sequence | MTDQRPIDAD NHYYEPLDAF TRHLDPAFTQ RGVQVLQKGK RVVVVIGGRV NTFIPNPTFN PVTKPGCLDL YFRGVMPEGV SRRTLMEVEP LAPEYRDRDV RIARLDEQGL AGAVLYPTMG VGVEEALRDD VPATMASLHA FNRWLEDDWG YSYQDRLFAV PLISLADPQA AVAEVERVLG LGARIVHVRP APVPAPGTGT SGRSLGHPAH DPVWARLAEA DVPVAFHLGD SGYHRISAMW GGSATLEAFG KTNVLAKIVV GERAIQDTMA SLVVDGVFAR HPRLRAVSIE NGSSWVKPLL RLMKKYANQS PESFSGNPVE AFTEHVWVAP YYEDDIAGLV ELIGADHVLF GSDWPHAEGL AEPLQFDKEI ECFDATTKAR IMRGNSAALL GL
|
| |