Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0619 |
Symbol | |
ID | 5669036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 720249 |
End bp | 721457 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239546 |
Product | amidohydrolase 2 |
Protein accession | YP_001504984 |
Protein GI | 158312476 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.6071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCAT CACAGTCGCT GGACTGGCTG ATCTCGGTCG ACGACCACGT CCTGGAGCCG CCGAACCTGT GGACCGACCG GCTCCCGGCC AAGGACCACG ACCGGGCTCC CCACATGGTG ATCGACGACA CGGGAATGGA CTGCTGGGTC TACGACGGCA AGCGTTTCCC GAGCTCCGGG CTGAGCGCCG TCGCCGGGAA GGAGAAGGAG GAGTTCAGCC CCGAGCCCCT CTCCTATGCC GACATGCGGC CCGGTTGCTA CGACCCGCAG GCCCGCCTGG AGGACATGAA CCGGGCCGGC ATCCTGGCCT CGCTGTGCTT CCCGACGGTG ACCCGGTTCT GCGGGCAGAT GTTCTCCGAG GCGAGCGACC GCGAGTTCGG CCTGGTGTGC CTGAAGATCT ACAACGACTG GATGATCGAG GAGTGGTGCG GCAGCGCTCC CGGCCGCTAC ATCCCGCTCA CCCTCATCCC GCTGTGGGAC CCGCAGCTCG CGGTGAAGGA GCTCGAGCGC TGCGCGGCGA AGGGGTCCAC CACCTTCGCC TTCTCGGAGA ACCCGGCCCC GCTGGGCCTG CCGACCATCC ACGACCGCGA CGGGTACTGG GAGCCGGTGA TGGCTGCCGC GAACGACCTG GAGATGGTCG CGTCGATGCA CGTCGGCTCC TCGTCGCAGG TGCCGAAGAT CGCTCCCGAC GCGCCGTTCA TGGCGAACCT GACCTGGGGC GCGATGCGTA CCTCGGGCGC CATGCTCTCC TGGCTGTTCA GCGGGATGTT CCAGCGGTAC CCGAAGCTGA AGATCGCGCT CTCGGAGGGC GAGATCGGCT GGATGCCGTA CTACCTGGAG CGCGCCGAGC AGGTGATCGA CAAGCAGCGC CACTGGGTCA AGCGTGGTGT CCGCTTCAAC GAGCACGCCG GGGCGGACGC CCTCGACCTG GACACCCTCG ACATCCGCGC CAGCTTCCGT GAGCACGTCT TCGGTTGCTT CATCGACGAC GCGCACGGCA TCGCCAGCAT CGACGAGATC GGCGAGGACA ACATCATGTG CGAGACGGAC TACCCGCACT CCGACTCGAC CTGGCCGAAC TGCATCGACG TCGTCAGGAA CCGGATCGGC CACCTGTCGG AGGAGGTCCA GTACAAGATC CTGCGCGGCA ACGCCGAGCG GCTGTACCGG TTCACCCCGG CCGAGCCGCC TGTGCTCGCG AAGGCCTGA
|
Protein sequence | MTSSQSLDWL ISVDDHVLEP PNLWTDRLPA KDHDRAPHMV IDDTGMDCWV YDGKRFPSSG LSAVAGKEKE EFSPEPLSYA DMRPGCYDPQ ARLEDMNRAG ILASLCFPTV TRFCGQMFSE ASDREFGLVC LKIYNDWMIE EWCGSAPGRY IPLTLIPLWD PQLAVKELER CAAKGSTTFA FSENPAPLGL PTIHDRDGYW EPVMAAANDL EMVASMHVGS SSQVPKIAPD APFMANLTWG AMRTSGAMLS WLFSGMFQRY PKLKIALSEG EIGWMPYYLE RAEQVIDKQR HWVKRGVRFN EHAGADALDL DTLDIRASFR EHVFGCFIDD AHGIASIDEI GEDNIMCETD YPHSDSTWPN CIDVVRNRIG HLSEEVQYKI LRGNAERLYR FTPAEPPVLA KA
|
| |