Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6837 |
Symbol | |
ID | 5675150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8333395 |
End bp | 8335065 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245686 |
Product | amidohydrolase 3 |
Protein accession | YP_001511077 |
Protein GI | 158318569 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0874953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTGC TGCTGCGCCA CGCCTCGTTG ATCGACGGGA CGGGTGCCCC GGCCCGCGCC GCGGAGGTCG CCGTCAGCGG TGGCCGCGTG GTCGCCGTCG GTGCCCCCGG CGAGCTCACC CCGACGCCCG ACTGTGAGGT CGTCGACCTC GAGGGGCTCA CCCTCACCCC CGGCTTCATC GACGTGCACA CCCACTACGA CGCCCAGATT CTCTGGGACG GCGACCTCAC GCCGTCGAGC TGGCACGGCG TGACCAGCGT GATCATGGGC AACTGCGGCT TCGGGGTGGC GCCGACCAGG CCCGAGCACC GCGACATCAT CATGCGCCTC CTGGAGAACG TCGAGGGGAT GTCTCTCAGC GCACTTGACG CCGGCATCTC CTGGTCCTTC GAGACCTTCC CGGAGTACCT CTCCGCCCTC GACCAGCGGC CCAAGCGTCT CAACGTCGGC GCCTTCATCG GCCACTCGCC GCTGCGCGCG TTCGTGGTCG GTGGTGAGGA GCGCCCGGCG ACACCGGCCG AGCTCGAGCG GATGCGCGAG ATCGTGCGCG AGGCCCTCGA GGCCGGCGCC ATCGGGTTCT CCACCTCGCG CCAGCCCGCT CACCAGGGCG CCTACGGCCG TCCGGTGCCG AGCCGCTTCG CCGAGGTCGA CGAGGTCTAC TCGATGGTCT CGGTGCTCAG CGAGCTCGGG CGCGGCGTCG TCCAGGTCTC CATCGGCCCG GGCCTGTTCG TCGACCAGTT CTCCGAGCTG GCCACCCGCT TCGGCGTCCC GGTGACGTGG ACGGCGCTGG TCGCCCGCGC GGACAAGCCA GGCTCGGCGA TGCGCACCGT CGAGCGGGCC GGCGCACTGC CCGGCGAGGT CTACCCGCAG ATCGCCTGCC GCCCCATCGT CATGCAGATC ACGATGGACG ACCCGGTGCC GCTCGCCGAG ATCGACGAGT GGAAGGAAGC GCTGGCGCGG CCCCGCGAGG AGCGTGCCGA CCTCTACCGC GACGCCTCCT GGCGGGAGCG GGCCCGCCCC GCGACGCTGA GCGCGTGGAG CCACCGCTGG TCCAAGATCG ATGTCGAGGA GACCGGCGCC CATCACGACG TCGTGGGCAT CCCGCTGGAC CGGCTGGCGC AGCAGCGGGG TACCACCCCC TTCGACCTCA TGCTCGACCT CGCGCTCTCG GACAGCGTCC CCACCCGCTT CCGCGTGGTG CTAGAGAACG ACGGTGACGC CGAGATCGCC CAGCTGCTGG CGGACAAGCG CACCCTGCTC GGGCTCTCCG ACGCCGGCGC CCACGCCAAC CAGCTCTGCG ACGCCTGCTA CTCCACCCAC CTGCTCGGCC ACTGGGTGCG CGAGCGCGGC GCGATCTCGC TGGAGGACGC GGTCTGGCGC CTCACCGGTC ATCCCCACCA GGCCTTCCGG GTGGACGGCC GCGGGCTGGT CAAGGAGGGC TTCCACGCCG ACCTCGTCGC CTTCGACCCG GCCACGGTGG GCACGACCCC GGTCGAGCGG GTGTACGACC AGCCCGGCGG CGCCGACCGC CTGGTGGTGC GCAGCACCGG CATCGAGCAC GTGTGGGTCA ACGGCGTAGC CACCCGTTCG TACGGTAAGG ACATCCCCGG CGCCACGCCC GGCCGCCTGC TGCGCGCCCG GGGCACCGGG GAGGACACCC CGCAGTCCTG A
|
Protein sequence | MDLLLRHASL IDGTGAPARA AEVAVSGGRV VAVGAPGELT PTPDCEVVDL EGLTLTPGFI DVHTHYDAQI LWDGDLTPSS WHGVTSVIMG NCGFGVAPTR PEHRDIIMRL LENVEGMSLS ALDAGISWSF ETFPEYLSAL DQRPKRLNVG AFIGHSPLRA FVVGGEERPA TPAELERMRE IVREALEAGA IGFSTSRQPA HQGAYGRPVP SRFAEVDEVY SMVSVLSELG RGVVQVSIGP GLFVDQFSEL ATRFGVPVTW TALVARADKP GSAMRTVERA GALPGEVYPQ IACRPIVMQI TMDDPVPLAE IDEWKEALAR PREERADLYR DASWRERARP ATLSAWSHRW SKIDVEETGA HHDVVGIPLD RLAQQRGTTP FDLMLDLALS DSVPTRFRVV LENDGDAEIA QLLADKRTLL GLSDAGAHAN QLCDACYSTH LLGHWVRERG AISLEDAVWR LTGHPHQAFR VDGRGLVKEG FHADLVAFDP ATVGTTPVER VYDQPGGADR LVVRSTGIEH VWVNGVATRS YGKDIPGATP GRLLRARGTG EDTPQS
|
| |