Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3331 |
Symbol | |
ID | 5671703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3943078 |
End bp | 3944253 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242220 |
Product | amidohydrolase 2 |
Protein accession | YP_001507640 |
Protein GI | 158315132 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.550336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCAAGC TGGCTGACGG AATTCGAGTG GTCGACGCCG ACGCGCACAT GACCGAGCGC CACGACCTGT TCACCGAGAG GGCGCCGAAG GGCTACGAGG ACAGGGTCCC GCACGTCGAG CGGATCGACG GCGTCGACAT GTGGATCGTC GAGGGCAAGG CGTTCGGCAA GGCAGGCTCC GGCGGCACCG TCGACCACGA CGGGAAGAAG CACCCGTTCC GGGACTCCCA AGGCGGGTCC TGGGGCATCA ACGACGTGCA CCCCGCGGCG TGGGACCCGA AGGAGCGCCT GCGCCTGATG GATGAGCTCG GCATCCACAC GCAGGTCCTC TACCCCAACG CGATCGGCAT CGGCGGCCAG AACCTGCGGA ATTCGGTCCA GGACCCGATC GTTCTCCGGC TCTGCGTCGA GCTCTACAAC GACGCGATGG CGGAGGTCCA GGCGGAGTCG GGCAACCGGC TGCTCCCGAT GCCGATCATG CCCGCGTGGG ACGTCGAGGC CTGCGTCCGG GAGGCCGAGC GCTGCGCCGC CCTGGGCTAC CGCGGGGTCA ACATGACCGC CGACCCGCAG GACTCCGGCT CACCCGACCT GGGCGACACC GCCTGGGACC CGTTCTGGGA GGTCTGCGCC GGGAACAAGC TCCCCGTGCA CTTCCACATC GGGGCGAGCC AGACGGCGCT GTCCTACTTC GGCACGACCT ACTGGCCCAG CCAGGACGAC TACGTGAAGC CGGCGATCGG CGGCGCGTCG CTGTTCCAGA ACAACTCCCG GGTACTGCTC AACAGCGCCT ACTCCGGGAT GTTCGACCGT CACCCCGACC TGAAGATGGT CTCGGTCGAA AGCGGCATCG GCTGGGTGCC GTTCATGCTC GAGGCGATGG ACTACGAGCT TGAGGAGAAC GCACCGGAGT ACTTTCACAA GCTGCAGAAG CGGCCGTCGG AGTACTTCGC GTCGAACTGG TACGCGACCT TCTGGTTCGA GAAGGGCCGC GGCGACCTCC AGCACCTCAT CGACACCGTC GGCGAGGACA ACATCATGTT CGAGACGGAC TTCCCGCACC CGACCTGCCT GCACCCGGAC CCCCTCGGAA TCGTTGGCGA GACGATCGCC TCGCTGCGTC CCGAGACGCA GCGGAAGGTC ATGGGCGGCA ACGCGGTCAA GCTCTACCGC GTCTGA
|
Protein sequence | MVKLADGIRV VDADAHMTER HDLFTERAPK GYEDRVPHVE RIDGVDMWIV EGKAFGKAGS GGTVDHDGKK HPFRDSQGGS WGINDVHPAA WDPKERLRLM DELGIHTQVL YPNAIGIGGQ NLRNSVQDPI VLRLCVELYN DAMAEVQAES GNRLLPMPIM PAWDVEACVR EAERCAALGY RGVNMTADPQ DSGSPDLGDT AWDPFWEVCA GNKLPVHFHI GASQTALSYF GTTYWPSQDD YVKPAIGGAS LFQNNSRVLL NSAYSGMFDR HPDLKMVSVE SGIGWVPFML EAMDYELEEN APEYFHKLQK RPSEYFASNW YATFWFEKGR GDLQHLIDTV GEDNIMFETD FPHPTCLHPD PLGIVGETIA SLRPETQRKV MGGNAVKLYR V
|
| |