Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4629 |
Symbol | |
ID | 5675741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5521926 |
End bp | 5523125 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243487 |
Product | amidohydrolase 2 |
Protein accession | YP_001508903 |
Protein GI | 158316395 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGGCCCG TCGTCGACGA GGAGGAGGTA GGAACCGTGG CAAAGATCTG GGCCAATTCG GGTGACTCGC ACCTTGTCGA GCCGGACGAT CTTTTCGACA GGAGCTTGCC CGGGTCCCTC GCGGCACGGA TGCCGCGCAG CGTCAAGGAC CTCGATGGCG GCTGGGAGAC GATCCACGTC GACGGACAGG AGTTCCGCCG CCGCCTGCCC CGGCCCGGCA GGAAACTCAC CGACGAGAAC GGCCACACCG TTCCCGAGCG GGCGCCCGGC GCCAACGACA GGAAGATGCG CCTGCTCGAC CTGGACTCCG AAGGCATCTG GGCCGAGCTC ATCTACCCCT CGCTGGGGAT GTGGACCTCA TCGATCCGTG ACCCGGAGCT GCTCGCGGCC GGCGCCCGAG CGATCAACGA CTGGGCCATT GAGTACCAGC GGTTCTCACC CCGCTACGTC AGCGCGGCGA CGATCCCCTT CCTCACCGTC TCCACCGCCG TCGCCGAGGT CACGCGGGCC GCGAGTCTCG GCTTCCATGC CGCGTCCCTG CCGGTCGTGC CACTCATCGA CGGGGACGAC TGGCATCGCG AGTCGTGGGA GCCGCTGTGG ACCGCCCTCG CGGAGACGGG TCTCGTGATC GCGTTCCACA TCGGCAGCGA GCCCCATGAG GCGTCCAGCC GTAACGGCAC CTACTACCGT GGTCCCGGCG GGGCGCTCCT GAACTACCTG GAGACGACCT ACGGCGGCCA GCGGGCGGTG GCCAAACTGA TCGCCGGCGG TGTGTTCGAC CGGCACCCGT CCTTGCGTGC AATCGTGTCC GAGGGCGGAG CGACCTGGGG GCCGTTCGTC GCCGACCGGC TCGACGAGGC GTACCGGCAG CACGCGTCGG CCGTCCACCC CCGTCTGCGT CGGCTCCCCA GCGCGTACCT GTACGAGAAC GTCTACGCGT CCTTCCAACA CGACCGCTCG GCCGTCGCCG CGGTGACGGC CATGGGCTGG CGCAATGTGT GCTGGGGCAG CGACTACCCG CACATCGAAG GCACCTTCGG TCACACCCAG AAGACACTCC AGGAGCTCGT CGGCGACCTC GACCCCGCCA CCCGGCACCG CATCACCCAG GGCGCCTTCC AGGAGCTGTT CCCCCACGTG CCGCCCGCTC CAGACGGGGA GTCATCGCCT GACCCCCTGC TCGATGCAGC GGCGAGGTGA
|
Protein sequence | MRPVVDEEEV GTVAKIWANS GDSHLVEPDD LFDRSLPGSL AARMPRSVKD LDGGWETIHV DGQEFRRRLP RPGRKLTDEN GHTVPERAPG ANDRKMRLLD LDSEGIWAEL IYPSLGMWTS SIRDPELLAA GARAINDWAI EYQRFSPRYV SAATIPFLTV STAVAEVTRA ASLGFHAASL PVVPLIDGDD WHRESWEPLW TALAETGLVI AFHIGSEPHE ASSRNGTYYR GPGGALLNYL ETTYGGQRAV AKLIAGGVFD RHPSLRAIVS EGGATWGPFV ADRLDEAYRQ HASAVHPRLR RLPSAYLYEN VYASFQHDRS AVAAVTAMGW RNVCWGSDYP HIEGTFGHTQ KTLQELVGDL DPATRHRITQ GAFQELFPHV PPAPDGESSP DPLLDAAAR
|
| |