Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3915 |
Symbol | |
ID | 5672276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4681202 |
End bp | 4682377 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641242794 |
Product | amidohydrolase 2 |
Protein accession | YP_001508211 |
Protein GI | 158315703 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.130175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0967044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATAC CTTTAAAAGA GAGCGGCCTG TTCATCGTCG ATGCGGACTC GCACTGGTCA GAGCCCCCGG ATCTGTTCAC CAGGCGCGCA CCGGCTGCAT ACCGTGACCG GGTGCCCCGC GTGGAGGATG TCGACGGCAC CCCGATGTGG GTCTTTGACG GTAAGCCTGT CGGGCGTTTC AGCGCAGCCG GTGTCATCGG CCGCGACGGC CGCAAGGAGA GCGCGGACAC CGCGCTGCAC CACTGGACGA TCGACCAGGT CCACGTCGGC GCCTACGACC CGGCCGTCCG CCTGGGCGTG CTCGACGAGT GCGGCATCGA CGCGCAGATC ATCTTTCCGA GCACGATCGG CCTCGGCGGC CAGGACCTGG GCGCGGCCAG TGACCCCGCG CTGACCCGGC TCTCGGTCGA GATCTACAAC GACGCCATGG CCGAGATCCA GAGCGACTCG GGGAACCGGT TGTTACCCCT GCCGCTCATG CCGGCCTGGG ACGTCGACCT GTGCGTCGCA GAGGCCCGCC GGGTCCACGC CCTCGGCGCA CGCGGGGTCA ACATGACCTC GGACCCGCAG GACCTGGGCG CCCCCGACCT CGCCAACCCG GCCTGGGACC CGTTCTGGGA GGTGTGCACA GAGCTGCAGC TGCCGGTCCA CTTCCACATC GGGGCCAGCG TGACCACGAT GACGTTCTAC GGTAAGTACC CGTGGCCGTC ACACGACAAC AACACCAAGC TCGCCATCGG CGGCACACTG CTGTTCATCG GGAACGCACG GGTGGTGACG AATCTGATCC TCTCCGGGAT CTTCGACCGG CACCCCGACC TGAAGACGGT GTCGGTGGAG AGCGGTGTCG GCTGGATCCC GTTCATCCTC GAGGCGTTGG ACTACGAGAT GTCCGAGAAT GCGCCCGAGG AGCTCGGCCG GATGCGCAAG CCGCCGTCCG AGTACTTCCG GAGCAACATC TACGCCACGT TCTGGTTCGA GAAGAACAGG AACAAGCTCC CGGCACTGAT CGACGCGGTC GGCGAGGACA ACATCCTCTT CGAGACGGAC TTCCCCCACC CAACCTGCCT CTACCCTGAC CCGCTGGGCA CCGTCGAGCC GAAGCTGGCC ACGTTGTCTC CGCAGGCCCG GGCGAAGATC CTGGGAGAGA ACGCCCGCAG GCTGTACCGC CTCTGA
|
Protein sequence | MSIPLKESGL FIVDADSHWS EPPDLFTRRA PAAYRDRVPR VEDVDGTPMW VFDGKPVGRF SAAGVIGRDG RKESADTALH HWTIDQVHVG AYDPAVRLGV LDECGIDAQI IFPSTIGLGG QDLGAASDPA LTRLSVEIYN DAMAEIQSDS GNRLLPLPLM PAWDVDLCVA EARRVHALGA RGVNMTSDPQ DLGAPDLANP AWDPFWEVCT ELQLPVHFHI GASVTTMTFY GKYPWPSHDN NTKLAIGGTL LFIGNARVVT NLILSGIFDR HPDLKTVSVE SGVGWIPFIL EALDYEMSEN APEELGRMRK PPSEYFRSNI YATFWFEKNR NKLPALIDAV GEDNILFETD FPHPTCLYPD PLGTVEPKLA TLSPQARAKI LGENARRLYR L
|
| |