Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4312 |
Symbol | |
ID | 5672667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5151241 |
End bp | 5152419 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243185 |
Product | amidohydrolase 2 |
Protein accession | YP_001508602 |
Protein GI | 158316094 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.537977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACGC GAGTGCCGCC GTACCCGATC TTCGACGCGG ACAACCACCT GTACGAGACC CAGGACGCGT TCACCAAGTA CCTGCCCGAG GAGTTCAAGG GCGCGATCCG GTACGTCGAG GTGGACGGCC GGACCAAGAT CGCCGTCCGG GGTCACATCA GCGAGTACAT CCCCAATCCG ACCTTCGAGG TCGTCGCCCG GCCCGGTGCG CAGGAGGACT ACTTCCGCAT CGGAAACCCG GAGGGCAAGT CCTACCGGGA GATCATCGGT AAGCCGATGC GCTCGATCCC GGCCTTCCGC GAGCCGAAGT CCCGCCTCGA GCTCATGGAC GAGCAGGGCA TCGAGCGGAC GCTGATGTTC CCGACGCTCG CGAGTCTGCT CGAGGAGCGG ATGAGCGACG ACCCGGAGCT GACCCACATC GTCATCCACG CGCTCAACGA GTGGCTGTAC GAGACGTGGC AGTTCAACTA CGAGAACCGG ATCTTCACCA CCCCGGTGAT CACGCTGCCG ATCGTGGAGA AGGCGATCGA GGAGCTGGAG TGGGTCGTCG AGCGCGGTGC GCGGGCGATC CTGATCCGTC CGGCGCCGGT CCCCGGGTTC AGCGGCTCGC GGTCGTTCGG CCTGCCGGAG TTCGACCCGT TCTGGGAGAA GGTCGTCGAG CACGACCTGC TGGTCACGCT GCACTCCTCG GACAGCGGCT ACAGCCGTTA CACCGACGAC TGGGAGAGCA ACAAGCGCGA GTTCCTGCCG TTCCAGCCGG ACGTGTTCCG GATGATCGGC CAGTGGCGGC CGATCGAGGA CTCGGTGGCC GCGCTGATCT GCCACGGCGC GCTGTCCCGG TTCCCGAGCC TGAAGATCGC CGTAGTCGAG AACGGCGCCT CCTGGGTCGG CCCGCTGATC AAGACCCTGA AGGACGTCTA CAAGAAGTAC CCGCAGCAGT TCGCCGAGGA GCCGGTGTCG GTGCTGCGGC GCAACGTGCA CATCAGCCCG TTCTGGGAAG AGAACATGGC CGACCTCGCC GAGCTGCTCG GCGTCGAGCG GGTGCTGTTC GGCTCGGACT TCCCGCACCC CGAGGGCCTC GGTGACCCGG CCAGCTTCGT CGACGAGCTC AAGAACCTCA CCGACGAGGA GAAGGCCCTG ATCATGGGCG GAAACCTGGC CAGGCTGATC TCGGGATGA
|
Protein sequence | MSTRVPPYPI FDADNHLYET QDAFTKYLPE EFKGAIRYVE VDGRTKIAVR GHISEYIPNP TFEVVARPGA QEDYFRIGNP EGKSYREIIG KPMRSIPAFR EPKSRLELMD EQGIERTLMF PTLASLLEER MSDDPELTHI VIHALNEWLY ETWQFNYENR IFTTPVITLP IVEKAIEELE WVVERGARAI LIRPAPVPGF SGSRSFGLPE FDPFWEKVVE HDLLVTLHSS DSGYSRYTDD WESNKREFLP FQPDVFRMIG QWRPIEDSVA ALICHGALSR FPSLKIAVVE NGASWVGPLI KTLKDVYKKY PQQFAEEPVS VLRRNVHISP FWEENMADLA ELLGVERVLF GSDFPHPEGL GDPASFVDEL KNLTDEEKAL IMGGNLARLI SG
|
| |