Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7100 |
Symbol | |
ID | 5675409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8668598 |
End bp | 8669887 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245943 |
Product | amidohydrolase 2 |
Protein accession | YP_001511334 |
Protein GI | 158318826 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACG ACGACATGAT CCTTGTCAGC ATCGACGATC ACATCATCGA ACCGCCCGAC ATGTTCGCGG ACCACCTCCC CGAGCGGTAC AAGCAGGACG CGCCGCACGT GGTGCGCCTG CCTGGTGGCG CCGACGCGTG GAAGTTCCGG GACACGGTGA TCCCCAACGT CGCCCTCAAC GCGGTGGCCG GCCGACCCAA GGAGGAGTAC GGACTCGAAC CACAGGGCTT AGACGAGATC CGTCCGGGGT GCTACCAGGT GGACGAGCGG ATCAAGGACA TGAACGCGGG CGGCATCCTC GCGTCGATGA ACTTCCCGTC GTTCCCGGGG TTCGCGGCCC GGCTGTTCGC CACCGAGGAC CCAGACTTCT CCCTCGCGCT GGTGCGGGCG TACAACGACT GGCACCTCGA CGAATGGTGC GGCGCGCACC CAGGCCGGTT CATCCCGATG GCACTGCCCG TCATCTGGGA CGCCGAGCTG TGCGCGGCCG AGGTACGCCG GGTCGCCGCG AAGGGATGCC ACTCGCTGAC CTTCACCGAG AATCCCGCGG CGCTGGGATA CCCCAGCTTT CACGACGCCT ACTGGAACCC GTTGTGGCAG GCGGTGTGCG ACACGAACAC CGTGCTGTCG ATCCACATCG GCTCGTCAGG CCAGCTGACC ATCCCCGCTC CCGACTCGCC GCCCGACGTG CTGATCACAC TCCAACCGAT GAACATCGTC TCGGCCGCCG CCGACCTGCT GTGGTCACCG GTACTCAAGA ACTTCCCCGG CATCAGGATC GCGCTGTCCG AGGGCGGTAC AGGGTGGATC CCCTACTTTC TGGAGCGGGT CGACCGCACC TTCCACATGC ACGCCACCTG GACGATGCAG GACTTCGGCG GCCGGCTCCC GTCCGAGGTG TTCCGCGAGC ACTTCCTGAC CTGCTTCATC AGCGACCCGC TCGGGGTCGA ACTTCGCCAC CGGATCGGCA TCGACAACAT CGCCTGGGAG TGCGACTACC CGCACAGCGA CTCGATGTGG CCGGGCGCTC CAGAGGAGCT CGGCGAAGTG ATGAACCGAC TCGCCGTGCC GGAACCCGAG ATCAACAAAA TGACCTACGA GAACGCCCTG CGCTGGTACT CCTTTGACCC CTTCGCCCAT GTTCCGCGGG AGCACGCCAC GGTGGGGGCG CTGCGGGCCC AGGCCGCCGG GCACGACGTC TCCATCCGGG CGCTGAGCCA CCGCACCTAC GAGCGCGGCG AGAAGCTTGC GGCCTACCAG GCGATGGCAG CCTCCGCACC TGAGCGGTAG
|
Protein sequence | MRYDDMILVS IDDHIIEPPD MFADHLPERY KQDAPHVVRL PGGADAWKFR DTVIPNVALN AVAGRPKEEY GLEPQGLDEI RPGCYQVDER IKDMNAGGIL ASMNFPSFPG FAARLFATED PDFSLALVRA YNDWHLDEWC GAHPGRFIPM ALPVIWDAEL CAAEVRRVAA KGCHSLTFTE NPAALGYPSF HDAYWNPLWQ AVCDTNTVLS IHIGSSGQLT IPAPDSPPDV LITLQPMNIV SAAADLLWSP VLKNFPGIRI ALSEGGTGWI PYFLERVDRT FHMHATWTMQ DFGGRLPSEV FREHFLTCFI SDPLGVELRH RIGIDNIAWE CDYPHSDSMW PGAPEELGEV MNRLAVPEPE INKMTYENAL RWYSFDPFAH VPREHATVGA LRAQAAGHDV SIRALSHRTY ERGEKLAAYQ AMAASAPER
|
| |