Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3688 |
Symbol | |
ID | 5672054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4365126 |
End bp | 4366310 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242571 |
Product | amidohydrolase 2 |
Protein accession | YP_001507991 |
Protein GI | 158315483 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCC ACGACATTCC GGTGTTCGAC GCCGACAACC ACCTGTACGA GACCCAGGAC GCGCTCACCA AGTTCCTGCC GGCGCGCTAC CGGGGCGCGA TCGACTACGT CGACGTGCAC GGCCGGACGA AGATCGTCGT GCGCGGGCAG ATCAGCCAGT ACATCCCGAA CCCGACCTTC GAGGTCGTCG CCCGCCCGGG GGCGCAGGAG GACTACTACC GGCACGGCAA CCCCGAGGGG AAGACGTACC GGGAGATCTT CGGCAAGCCG GTGCGGTCCA TCGACGCCTG GCGGGAGCCG GCCGCGCGCA TCAAGGTCAT GGACGAGCAG GGGCTCGACC GCACCCTGAT GTTCCCCACG CTCGCCAGCC TGATCGAGGA GCGGATGCGC GACGACGCGG ACCTCGTCCA CGCCGTCATC CACTCGCTCA ACGAGTGGCT GTACGAGACC TGGCAGTTCA ACTACCAGGA CCGGATCTTC ACCACGCCGG TGATCACCCT GCCGATCGTG GAGAGGGCCG TCGAGGAGCT GGAGTGGGTC CTCGAGCGGG GCGCCCGGGT CATCCTCGTC CGGCCGGCGC CCGTGCCGGG CCTGCGCGGG CCGCGCTCGT TCGGCCTGCC GGAGTTCGAC CCGTTCTGGG CCCGCGTGCA GGAGGCCGAC ATCCTCGTCG CACTGCACTC GTCCGACAGC GGGTACGCCC GCTACAGCGG CGAGTGGATG GGCGCCAACC GCGAGATGCT GCCGTTCCAG CCGAACCCGT TCCAGATGCT GCAGGCATGG CGGCCGGTCG AGGACGCGGT TTCGGCGCTC GTCTGCCACG GCGCGCTCTC CCGCTTCCCC CGGCTGAAGG TGGCCGTCGT CGAGAACGGG ATGAGCTGGG TCGCCCCGCT GATGGACGCC ATGAAGAACC TGTACAAGAA GATGCCGCAC GACTTTCCCG AGAACCCGCT CGACGTGATC CGGCGCAACG TCTACGTCAG CCCGTTCTGG GAGGAGGACC TCGGCGCGCT GACCAAGGTC CTCGGTGAGG ACCACGTGCT GTTCGGGTCC GACTATCCGC ATCCGGAGGG GCTGGCGAAC CCGGTCAGCT ACATCGACGA GCTCGCCCAC CTGCCGGAAC CGCTCGTGCG CAAGCTCATG GGCGGCAACC TCGCCCAGCT CATGAAGGTC CCGGCCGCGG TCTGA
|
Protein sequence | MPTHDIPVFD ADNHLYETQD ALTKFLPARY RGAIDYVDVH GRTKIVVRGQ ISQYIPNPTF EVVARPGAQE DYYRHGNPEG KTYREIFGKP VRSIDAWREP AARIKVMDEQ GLDRTLMFPT LASLIEERMR DDADLVHAVI HSLNEWLYET WQFNYQDRIF TTPVITLPIV ERAVEELEWV LERGARVILV RPAPVPGLRG PRSFGLPEFD PFWARVQEAD ILVALHSSDS GYARYSGEWM GANREMLPFQ PNPFQMLQAW RPVEDAVSAL VCHGALSRFP RLKVAVVENG MSWVAPLMDA MKNLYKKMPH DFPENPLDVI RRNVYVSPFW EEDLGALTKV LGEDHVLFGS DYPHPEGLAN PVSYIDELAH LPEPLVRKLM GGNLAQLMKV PAAV
|
| |