Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6853 |
Symbol | |
ID | 5675166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8353720 |
End bp | 8355423 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245702 |
Product | amidohydrolase 3 |
Protein accession | YP_001511093 |
Protein GI | 158318585 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.130761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGACA TCGTGATCTA TGGCGGTGAG GTGGTCGACG GGACGGGTTC GCCGCCCCGT CGCGCGGACG TGGCGATCGA CGGGGAGCGC ATCACCGCGG TCGGTGAGGT CGGCGAGCGC GGCCGCCGCC AGATCGACGC CCGCGGGCGC CTGGTGACCC CGGGATTCGT CGACATCCAC ACCCACCTCG ACGCGCAGCT CTTCTGGGAC CCGGTGGCCA GCCCGTCCTC GTGGCACGGC GTGACCAGCA TCGTCCTGGG TAACTGCGGC GTCACGTTCG CGCCCGTGCG GCCCGGCGAG CAGCGTTACC TCGCCGAGAT GATGGAGTCC GTCGAGGACA TCCCGGCCGA CACGATCATG GACGGGATCG ACTGGAGCTG GGAGAGCTAC GGCGACTACC TGAAGGCGCT CGGCCGCCGG CAGCTCGGCG TCAACGTCGG CGGCATGATC GGCCACTGCG CGCTGCGCTA CTACGTCATG GGCGCGCGCA GCCTGGACGA GGCCCCCGCC ACCGACGAGG ACATCGCCAG GATGGCGGCG GTCGTCGGCG AGGCGATCGA CGGCGGCGCC CTCGGCTTCT CCACCTCCCG CAGCTTCATG CACACCGTGC CGGACGGCCG CGCCGTGCCG GGCACGTACG CCCTCGAGGC GGAGCTCGCC GCGATCGCCG GGGCCCTCGC CTCGCGCGGG CGCGGCACGA TCGAGGTCGT CCCCCGCATC GGCGAGCGGG ACGGACCGGA GCGGCAGAAC TCCGTCGCCG AGCTGGCCTG GATGGAGGAG GTGAGCCGCG CCTCGGGGCG CCCGCTCACC TTCGCGATCA TGCAGAGCGA CCGCCGGCCC GGCCTGTGGT CCTGGGTGAT GGACGAGGTG GCCGCCGCCC GCGGCCGCGG CGCCGACCTG CGTCCGCAGA CCGCGGCCCG GGGCAGCGGC ATCCTCTACG GGCTCGTCGG CCGCACGCCG TACGACGCGC TGCCGGGCTG GGCGAAGTTC ATGGAGCAGC CGTGGGCGGA GCGGCTGGCG GCGCTCCGCG ACGCCGAGGT GCGCCGCGCG CTGGTCGAGG AGGCGGAGAA CCCGGTCGAG CTGTCGGGCC CGCTGGCGCC GAAGGACCCG TCGAAGCTGT ATCTGCTCCC GCCCGGCCCG GCGCGGTACG ACGTCGACCC GGGTAACAGC CTGGCGGCCG AGGCGGCTCG CCGCGGGGTC AGCCCGGCGG CGGCGTTCCT CGCCTGCACC CTGGAGACCG ACGGCCGCGG GCTGCTCTAC TACCCGGTGC TCAACCAGGA CCTCGACGCC GTCGCCGCGA TGATCACGAA TCCGGACGTC GTCGTCGGCG TCGGGGACGC CGGCGCGCAC GTGGCGCTCA CCATGGACGC GGGCCAGCCC ACCTTCCTGC TGCGGCACTG GGTGCGTGAC AGGGGCCTGC TCGACGTCGG CACGGCGGTG CGCAAGCTGA GCTCCGAGGG GGCCGAGCTG TTCGGGCTCG CCGACCGGGG TGTCCTGAAG CCCGGCGCCT TCGCCGACGT CAACGTGATC GATCTTGATC GCCTCGACCT GGACACCCCC GAGATGCTCG CGGACTTCCC GCACGGGGCG AACCGGTTCG TCCAGCGTGC GCGGGGTTAC GACTACACCC TCGTCAACGG CTGTGTCCTG ATCGAGGGCG ATGAGTTGAC CGAGGAGCGT CCGGGACGGA TCGTGACCGC ATGA
|
Protein sequence | MHDIVIYGGE VVDGTGSPPR RADVAIDGER ITAVGEVGER GRRQIDARGR LVTPGFVDIH THLDAQLFWD PVASPSSWHG VTSIVLGNCG VTFAPVRPGE QRYLAEMMES VEDIPADTIM DGIDWSWESY GDYLKALGRR QLGVNVGGMI GHCALRYYVM GARSLDEAPA TDEDIARMAA VVGEAIDGGA LGFSTSRSFM HTVPDGRAVP GTYALEAELA AIAGALASRG RGTIEVVPRI GERDGPERQN SVAELAWMEE VSRASGRPLT FAIMQSDRRP GLWSWVMDEV AAARGRGADL RPQTAARGSG ILYGLVGRTP YDALPGWAKF MEQPWAERLA ALRDAEVRRA LVEEAENPVE LSGPLAPKDP SKLYLLPPGP ARYDVDPGNS LAAEAARRGV SPAAAFLACT LETDGRGLLY YPVLNQDLDA VAAMITNPDV VVGVGDAGAH VALTMDAGQP TFLLRHWVRD RGLLDVGTAV RKLSSEGAEL FGLADRGVLK PGAFADVNVI DLDRLDLDTP EMLADFPHGA NRFVQRARGY DYTLVNGCVL IEGDELTEER PGRIVTA
|
| |