Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6846 |
Symbol | |
ID | 5675159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8345936 |
End bp | 8347141 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245695 |
Product | amidohydrolase 2 |
Protein accession | YP_001511086 |
Protein GI | 158318578 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.206422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGTTCA GCTGGTTCAT TTCGGTCGAC GACCACCTCA TCGAGCCGGC GCGGCTGTGG CAGGAGCGCC TGCCACAGCG CTGGCGCGAC ACCGGGCCCC GCATCGTGCG GGACGGGAAG TCGGAGTTCT GGGTCTACGA GGACCGCCAG ATCGTCACCA CCGGCCTGAA CGCCGTGGCG GGCAAGACCC GCGAGGAGTT CTCGCCGGAG CCGATCTCGT ACGACGACAT GCGCCCCGGC TGCTACGAGC CCGCGGCCCG GGTGGCCGAC ATGAACCAGG GCAACGTGCT GTCGTCGATC CTGTTCCCGT CGTTCCCGCG GTACTGCGGC CAGGTCTTCC ACGAGGCCAA GGACAAGGAG CTCGGGCTGC TCTGCGTCCA GGCGTGGAAC GACTTCATCC TGGAGGAGTT CGGCGCGGCC TACCCCGGCC GCTTCATCCC CATGATGATC ATTCCGTTGT GGGACCCGGT GGCAGCGGCG GCCGAGATCC GGCGGACGGC GGCCCGCGGC GGCCGGTCGA TCGCCTTCTC GGAGAACCCG ACCAAGCTCG GTCTCCCGTC GATCCACACC GACTTCTGGG AGCCGATGTT CGAGGCCTGC AACGAGACCG GCTACGTGAT CTCGATGCAC GTCGGGTCGT CGTCCAACCT GATCCGCACC TCGCCGGACA TGCCGACGCT GGCCTTCATG GCCTACTCGG CGGCGGCGAA CCAGGCCGGC ACGTTGCTGG ACTGGCTGTT CAGTGGCATT TTCGACCGGT TCCCGAACCT CAAGATCGCT CTTTCCGAGG GCTCGATCGG CTGGATTCCG TACTTCCTGG AGCGGGCTGA GCAGGTCATC GACAAGCAGC GGTTCTGGGC GTCGCGGTTC GATATCGACA TGAACGCCTC CCACGAGCGC GGTGAGGCCA AGGGCGAGGC GAAGTTCAAC CTCGACACCA ACATTCGCCA GCTCTTCGCC GACCACGTTT TCGGCACCTT CATCGAGGAC CAGGCCGGCG TCCGCCTGCT CGACATCATC GGTGAGGACA ATGTGATGCT CGAGTGCGAC TACCCGCACT CGGACTCCAC CTGGCCGGAC ACCGTGAAGC TGGCCGGCGG CTGGCTCGGG CACCTTTCCG ACGAGGTCCA GCACAAGATC ACGATCGGGA ACGCGGCCCG CGTCTACAAC TTCACGCCTG CTGACCCGGC GACCATCACG CTGTGA
|
Protein sequence | MSFSWFISVD DHLIEPARLW QERLPQRWRD TGPRIVRDGK SEFWVYEDRQ IVTTGLNAVA GKTREEFSPE PISYDDMRPG CYEPAARVAD MNQGNVLSSI LFPSFPRYCG QVFHEAKDKE LGLLCVQAWN DFILEEFGAA YPGRFIPMMI IPLWDPVAAA AEIRRTAARG GRSIAFSENP TKLGLPSIHT DFWEPMFEAC NETGYVISMH VGSSSNLIRT SPDMPTLAFM AYSAAANQAG TLLDWLFSGI FDRFPNLKIA LSEGSIGWIP YFLERAEQVI DKQRFWASRF DIDMNASHER GEAKGEAKFN LDTNIRQLFA DHVFGTFIED QAGVRLLDII GEDNVMLECD YPHSDSTWPD TVKLAGGWLG HLSDEVQHKI TIGNAARVYN FTPADPATIT L
|
| |