Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4034 |
Symbol | |
ID | 5672392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4809715 |
End bp | 4810905 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242910 |
Product | amidohydrolase 2 |
Protein accession | YP_001508327 |
Protein GI | 158315819 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.182386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGA AACTGACCTT CCCGGTGTTC GACGCGGACA ACCATCTCTA CGAGACACGT GACGCCTTCA CCCGGCATCT CCCGGACCGT TACCGCGGCG CCGTCGACTA TGTCGAGGTC CGTGGCCGGA CGAAGATCGC CATCCGTGGC CAGATCAGCG ATTACATCCC GAATCCGACG TTCGACGTGG TGGCGCGGCC CGGCGCTCAG GAGGAGTACT TCCGTGTCGG GAACCCCGAG GGGAAGTCCT ACCGCGAGAT CGTCGGGGAC CCGATGCGCA GCATTCCGGC GTTCCGTGAG CCGGCGCCGC GCATCCAGCT CATGGACGAG CTCGGCGTCG ACCGTGCGCT GATGTTCCCG ACGCTCGCGA GCCTCCTCGA GGAGCGCATG CGGGACGACC CGGACCTGAC CCATGCCGTG GTCCACTCGC TCAACGAGTG GATTCACGAG GTCTGGTCGT TCAACTACGA GAACCGCATC TTCACCACTC CGGTGATCAC CCTGCCCATC GTCGAGAAGG CGATCGCGGA ACTGGAGTGG GCGGCCGAGC GCGGCGCGCG GGCCGTCCTG ATCCGCCCGG CGCCGGTGCC GGGCCTGCGC GGCTCGCGCT CGTTCGCGCT CCCCGAGTTC GACCCGTTCT GGCAGGCGGT GGTGGACGCC GACATCCTGG TGGCGATGCA CTCCTCCGAC AGCGGCTACT CGCGGTACCA GAGTGACTGG ACCGGCCCGC AGGAGATGCT GCCGTTCCGC CCGGACCCGT TCCGCATGGT CACGATGGGG CACCGGCCGG CCGAGGACTC GATGACGGCG TTCGTCTGCG GCGGCGTCTT CGCCCGGTTC CCGAAGCTGC GGGTCGCCTC GATCGAGGCC GGCGGCGACT GGGTGGTGCC GCTGCTCGAG CACCTCGCCG ACGCTTACAC GAAGATGCCG CACGGTTTCG ACGAGGACCC GGTGGAGGCG TTCCGGCGCA ACATCTACAT CAGCCCGTTC CACGAGGACA ACATCGAGAA GCTCGTCGAG GCCATCGGTG TGGACCACGT CCTCTTCGGT TCTGACTATC CGCACCCGGA GGGCCTCGCC GAGCCGTGCA GCTACGTCGA CCACCTGCCG GCCGGCATGC CGGAGGAGGA CGTCAGGAAG ATCATGGGCG AGAACCTCGC CCGTCTCACC GGAATCGCCG TCCCGGTCTG A
|
Protein sequence | MAKKLTFPVF DADNHLYETR DAFTRHLPDR YRGAVDYVEV RGRTKIAIRG QISDYIPNPT FDVVARPGAQ EEYFRVGNPE GKSYREIVGD PMRSIPAFRE PAPRIQLMDE LGVDRALMFP TLASLLEERM RDDPDLTHAV VHSLNEWIHE VWSFNYENRI FTTPVITLPI VEKAIAELEW AAERGARAVL IRPAPVPGLR GSRSFALPEF DPFWQAVVDA DILVAMHSSD SGYSRYQSDW TGPQEMLPFR PDPFRMVTMG HRPAEDSMTA FVCGGVFARF PKLRVASIEA GGDWVVPLLE HLADAYTKMP HGFDEDPVEA FRRNIYISPF HEDNIEKLVE AIGVDHVLFG SDYPHPEGLA EPCSYVDHLP AGMPEEDVRK IMGENLARLT GIAVPV
|
| |