Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6839 |
Symbol | |
ID | 5675152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8338922 |
End bp | 8340235 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245688 |
Product | amidohydrolase |
Protein accession | YP_001511079 |
Protein GI | 158318571 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00614522 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC GCATCTTGAT CAGGAACGCG AAGATACTCA CCTGCTCCGC GCCGACTCCG CGGGCCGTGC CCGGCGTGCC CGGCGGCGGT TCCGGCGCCG GGGCCAACGT CGGCACCGGG CCGGACGTCA TCGCCGACGG CGACCTCCTG ATCGAGGGCG ACCGGATCGC GCGGGTGCGG GCGGGGCGGA TCGAGGTCGA CTCCGGCTCG GCCCGCGTCA TCGACCTGCA CGGGGCGGCC GTCTTACCCG GCCTGGGAGA CGCGCACGTG CACATGAGCT GGCCGCTCGA CTTCGTCTTC GACCACGTCT CCGTCGCGAA CGCGCCGGCC GCGCCGCACG CGCTCGACGT CGCCGCCGTG GCCCGGACGT TCCTGGAGAG CGGCTACACG CTCGTCGTCG GGGCAGGGGT CTCCCAGCCG TTCGACGACG TGCGCACCAG GGACGCGATC GAGCGGGGCC TCATCCCCGG CCCGCGGGTC ATCCCCAGCG GCACGATGAT CACCGAGCGG GGTGCGATCA GCGCGGACAC CGGGATGACC TCGGTCGTCT CCGACGCCCG GGACCTCCGC GAGGTCGTCG CCCGCCAGTG CGACACCGGC GTCCGGGCGT TGAAACTGTT CGTCTCCGGG GACGGCATCG TCCCCGAGTA CCCCTCCGAC GACCTCTACA TGAACGACGA GATGCTGTCC GCGGCCGTCG ACGAGGCCGA CCGGTACGGC GCGTTCATCA CCGTGCACGC CCGCGGGTCG GACAGCGTCG CGATGGCGGC GCGCAACGGG GCCCGGGTCA TCCACCACGC CTGCTTCCTC GACGACAAGG CGGTGCACGA GCTGGAGGCC CGCCGGGACG ACGTCTGGGT GTGCCCCGGC CTGCACTACC TGTACGCGAT GGTCAGCGGC CACGCCGAGC CCTGGGGCGT CACCCCGGAG AAGATCGAGC GGTCGGGCTA CGAGAAGGAG TTCCGCGCCC AGGTCGAGGG CATCGGCATG CTGCGCGAGG CGGGCATCCG CATCCTGGCC GGCGGCGACT TCGGCCACCA GTGGACAAAA CACGGCACCT ACGCGGCCGA GCTGCAGCGT TACGTGGAGC TGGTCCACAT GTCGCCACAG GAGGCGATCA ACACGGCGAC CCGGAACATG GGCCCGCTGG TGGGCCTGGA CGTCGGCCAG ATCCGCGCGG GCTACCTCGC CGACCTGCTG ATCGTCGACG GCGACCCGCT CACCGACATC ACCGTGCTGC AGGACCCCGA CCGCCGTCGC GCGGTCGTCA AGGGCGGGCG GTTCGCCTAC GTCAACCCGC GGATGTTCCC ATGA
|
Protein sequence | MTERILIRNA KILTCSAPTP RAVPGVPGGG SGAGANVGTG PDVIADGDLL IEGDRIARVR AGRIEVDSGS ARVIDLHGAA VLPGLGDAHV HMSWPLDFVF DHVSVANAPA APHALDVAAV ARTFLESGYT LVVGAGVSQP FDDVRTRDAI ERGLIPGPRV IPSGTMITER GAISADTGMT SVVSDARDLR EVVARQCDTG VRALKLFVSG DGIVPEYPSD DLYMNDEMLS AAVDEADRYG AFITVHARGS DSVAMAARNG ARVIHHACFL DDKAVHELEA RRDDVWVCPG LHYLYAMVSG HAEPWGVTPE KIERSGYEKE FRAQVEGIGM LREAGIRILA GGDFGHQWTK HGTYAAELQR YVELVHMSPQ EAINTATRNM GPLVGLDVGQ IRAGYLADLL IVDGDPLTDI TVLQDPDRRR AVVKGGRFAY VNPRMFP
|
| |