Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2359 |
Symbol | |
ID | 5670755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2803704 |
End bp | 2804918 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641241276 |
Product | amidohydrolase 2 |
Protein accession | YP_001506697 |
Protein GI | 158314189 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.140148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTATCC AGAACCCCGA GGCTCGTCCG GATCTCGGTT ATCCGTTGAT CGACGCCGAC AATCATTACT ATGAGCCGTA CGACTGCTTC ACCCGCCTCA TCGAACCGGC CTTCGCCGAC CGGGCCATCA ACGTTCGGGT CGATGCCAAG GGCCGCGGGA AGCTTTATTT CGGCGACCGG CAGTTCCGTT TCATGCGGGT GATCCAGACC GACTACATCG GCGCGCCGGG CTCGCTGCGC CAGATGCTGG ACGACCCCGA CAGCAAGGAC GGCTTCGTCC ACCGCGAGAT CATCCGCGGC TGGGACCACC CCGACATGAT GCAGCGCGAC GCGCGCATAG CGAAGATGGA CGAGCAGGGT GTCCAGGCGG GCCTGATGCT CGGCACGGCG ATGCTGCAGG CCGAGAACGA GCTGCACGAC GACGTCCCCG CGCTCTACGC GAACATCCGC GCCTACAACC GCTGGCTGGA CGAGGAGTGG GGCTTCAACC GGGACAACCG GATCATCACA GCGCCGATGA TCTCCCTGGT GGACGCCGAG CTGGCCACCG CCGAGATCGA GCGGGTCATC GCCGCCGGCG CCCGCGTCGT CGTCATGAAG CCCGGTCCGC TGTGGGGCCG CTCCCCGGTC GACCCCATGT ACGACGGGTT CTGGTCGCGG TTGCAGGAGG CGGACGTCAA GCTCGTCTTC CACAGCACCG ACCCGCGCTA CCTGGCCACG CTCGGCGTGC AGTTCGGCGA GTCGCCGACG CCGCCCCTGC AGGGCCAGAC GCCGTTCCAG TGGTACCTCG TCTCCGGCAA GCCGGTCGCC GACACACTCG CGTCCTACGT CCTGAACAAC CTGTTCGGAC GGTTCCCGCG GCTGACCGTC GTCGCGCTCG AGTGCGGCGT CAACTGGGTC GTCCCGCTGC TGCACGACGT CGACCACGCC GCGCACATGG GGCGCAAGGG GCACTGGCCG GGCGGTGAGG TGGTCGGCCG GCCGAGCGAG ATCCTCCTGG AGCACCTCTA CGTCTCGCCG TTCTACGAGG AGGACGTGGT CGGCCTGGTC GAGGCGATCG GCCCGGAGCG TGTGCTCTTC GGGTCCGACT ACCCGCACCC GGAGGGGGTC CTGTGGCCGG TGGAGTTCGC GGCCAAGCTG GACGGCCTCG ACGAGCGTTC GGTGCGCATG ATCATGCGCG GTAACGCCGC GCGCCTGCTC GGCATCGAGG ACTGA
|
Protein sequence | MSIQNPEARP DLGYPLIDAD NHYYEPYDCF TRLIEPAFAD RAINVRVDAK GRGKLYFGDR QFRFMRVIQT DYIGAPGSLR QMLDDPDSKD GFVHREIIRG WDHPDMMQRD ARIAKMDEQG VQAGLMLGTA MLQAENELHD DVPALYANIR AYNRWLDEEW GFNRDNRIIT APMISLVDAE LATAEIERVI AAGARVVVMK PGPLWGRSPV DPMYDGFWSR LQEADVKLVF HSTDPRYLAT LGVQFGESPT PPLQGQTPFQ WYLVSGKPVA DTLASYVLNN LFGRFPRLTV VALECGVNWV VPLLHDVDHA AHMGRKGHWP GGEVVGRPSE ILLEHLYVSP FYEEDVVGLV EAIGPERVLF GSDYPHPEGV LWPVEFAAKL DGLDERSVRM IMRGNAARLL GIED
|
| |