Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4454 |
Symbol | |
ID | 5672805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5320990 |
End bp | 5322171 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243322 |
Product | amidohydrolase 2 |
Protein accession | YP_001508738 |
Protein GI | 158316230 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACTGA TGGATCCGGC GCTGTTAGAC GAAATCAAGA TCATTGACAC GGACACCCAT GTGGTGGAGC CACCGGATCT GTGGACCTCA CGGGTGTCGG TGCGCAAGTG GGGTTCGCTG GTGCCGCACA TCCGGCCGGA CGCGTCGGGT GACCCGGCGT GGTTCGTCGG GGACCAGCGG ATGTTGGGAG TGGCGGCGGC GGCGATGGCG GGCTGGCACG AGTACCAGCC GGATCATCCG CTGCGGCTGG AGGACGCGGA CCGGGCGGCG TGGGATCCGG CGGCGCGACT CAAGCGGATG GACGAGGACG GCATCCACGC GCAGGTCCTC TATCCAAACG TGGCGGGCTT CGGCGGGGGC AACTTCACGA AGGTCGAAAA TCCGGATCTG ATGCTGGATC TGGTGCGCGC CTACAACGAC TTCCTGACAG ATTTCGCCGG TGTCGCGCCG GGCCGTTACA TCCCGATCAG CGCCGTTCCG TTCTGGGACC TGGAGCTGGC GGCCAAGGAG ATCGACCGGG TCGCGGCGGC CGGGCACAAG GGTCTGATCA TGACGGCGGC GCCCGAGAAC TGGGGCCAGC CGTTCCTGGA GGACCCTCAC TGGGACCCGC TCTGGGCGAA GGCCCAGGAG GTCGGCCTGC CGATCAACTT CCACATCGGG TCGGGGGACA TCTCCGCGTA CCCGACGCAC CCCGGTGGGA AGCACGCCAA CTCGGCTTCG CTCGCGGTCC TCAACTTCAT GGGTAACGCG GCGGCCATCG TCCGGGTGAT CTGTGGCGGT ATCTGCCACC GGTTCCCGGA GCTGAACATC GTCTCGGTGG AAAGCGGCGT GGGCTGGATC CCGTTCGCGC TCGAGGCGCT GGACTGGCAG TGGTACAACT GCGGTGTTCC GCAGGAGCAC CCGGAGTACG AGCTGTCGCC GAAGGAGTAC TTCCTGCGGC AGGTCTACGG GTGTTTCTGG TTCGAGCGGG ACACCGCGAT GAGCGCGATC AGCCAGGTCG GGGCGCGGAA CTTCATGTAC GAGACGGATT TCCCGCACCC GACGAGCATG ACGCCCGGCC CGGCGTCGAT CGCGACGACG CCGCGGGAGT ACCTGCTCGC CGCGATGGCC GATCTGCCGG ACGAGACCGT GCGACTGCTG TTGCAGGACA ACGCCGCCCG CATCTACCAT CTCGATCTCT GA
|
Protein sequence | MRLMDPALLD EIKIIDTDTH VVEPPDLWTS RVSVRKWGSL VPHIRPDASG DPAWFVGDQR MLGVAAAAMA GWHEYQPDHP LRLEDADRAA WDPAARLKRM DEDGIHAQVL YPNVAGFGGG NFTKVENPDL MLDLVRAYND FLTDFAGVAP GRYIPISAVP FWDLELAAKE IDRVAAAGHK GLIMTAAPEN WGQPFLEDPH WDPLWAKAQE VGLPINFHIG SGDISAYPTH PGGKHANSAS LAVLNFMGNA AAIVRVICGG ICHRFPELNI VSVESGVGWI PFALEALDWQ WYNCGVPQEH PEYELSPKEY FLRQVYGCFW FERDTAMSAI SQVGARNFMY ETDFPHPTSM TPGPASIATT PREYLLAAMA DLPDETVRLL LQDNAARIYH LDL
|
| |