Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2987 |
Symbol | |
ID | 5671371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3514229 |
End bp | 3515176 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241891 |
Product | amidohydrolase 2 |
Protein accession | YP_001507311 |
Protein GI | 158314803 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.390637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGTC CCGGGCGCAT CGACGTCCAC CAGCACTTCC TGCCACCTGG CTATGCGCGA TGGCTTGAAT CAGGCGGCAT CGCGCAGGTC GGGGGGCGGG CCATACCTGA CTGGGACGCA GACAAGGCAA TAGGCCTGAT GGACCAGTTC TCGATCTCGA CGGGCATCCT GTCGCTGTCG GCGCCGGGGG TGTATCTCGG TGACGGCACG GATGCTCGGA CGATGGCCAC GCAGGTCAAC GAGGCTGCCG CCGAGTGTGT CTGGGACCGG CCAGACCGGT TCGGATTCTT CGCCACGGTC CCGCTCCCCG ACGTCGACGG GGCGCTGGAG GCGGCTGCAC ACGCCTTCGA CTCCCTGAAC GCCGACGGCG TGTGCCTGCT AGCGAACTAT CAGGGGATTT ACCTCGGCGA GCCGGTGTTC GACCCGCTGA TGGCCGAACT CGACCGCCGA CACGCCGTGG TGCTTGTCCA CCCCGCCGAA CTACCCGGGC CGGCCGTGCC CAGCATTCCT CCGTTCGCCA CAGACTTCCT GCTCGACACC AGCCGAGCCG CGTTTAACCT TGTCCGGCAC GAGGTGCCTC GGCGCTACCC GAACATCACC TTTATCCTCG CCCACGCCGG CGGGTTCGTC CCCTACGCCT CGCACCGCGT CGCGGTCGGC GTGACCGCCG AGACGGGCCG AGATCCGTTC GAGGTACTTG AGGACCTGCA GGGCTTCTAC TTCGACACGG CGCTGTCGTC CAGCCCGGCT GCGCTGCCCA CCCTGCTCGC CTTCGCCAAA CCCGGCCACG TCCTGTTCGG CACCGACTGG CCCTTCGCGC CCGACATCGC CGTTGCTTAC TTCACCGGAC AGCTCGACAG CTACGACGAC CTCGACAGCG ACGCGCGCGT CGCGATCAAC CACGGCAACG TCCAAGCGCT GTTCCCCCGA CTCATAGACC AGGAGTGA
|
Protein sequence | MIRPGRIDVH QHFLPPGYAR WLESGGIAQV GGRAIPDWDA DKAIGLMDQF SISTGILSLS APGVYLGDGT DARTMATQVN EAAAECVWDR PDRFGFFATV PLPDVDGALE AAAHAFDSLN ADGVCLLANY QGIYLGEPVF DPLMAELDRR HAVVLVHPAE LPGPAVPSIP PFATDFLLDT SRAAFNLVRH EVPRRYPNIT FILAHAGGFV PYASHRVAVG VTAETGRDPF EVLEDLQGFY FDTALSSSPA ALPTLLAFAK PGHVLFGTDW PFAPDIAVAY FTGQLDSYDD LDSDARVAIN HGNVQALFPR LIDQE
|
| |