Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3364 |
Symbol | |
ID | 5671735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3987893 |
End bp | 3989848 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242252 |
Product | endothelin-converting protein 1 |
Protein accession | YP_001507672 |
Protein GI | 158315164 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3590] Predicted metalloendopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACATCC TCGACGACGC CCGCGAGGGC ATGGACCTCG ACGTCCGACC GCAGGACGAC CTGTTCGGCC ACGTGAACGG CCGGTGGCTC GCCGAGACGG AGATCCCGTC CGACCGGTCG AGCTGGGGCC CGTTCGTGCA GCTGGCCGAT GACGCCGAGC GGCAGGTCCG CGACATCATC ACCGACCTCG CCGCGCGGGA CCAGGCCACC CAGGGCGAGG ACGCGCGGAA GATCGGCGAC CTCTACAACT CCTTCATGGA CACCGAGGCG CTCGAGGCGC TCGGCCTGGG CCCGGTGCGG CCGCTGCTGG ACGGCGCGCG CGGGCTGAGC GACGTCCGCG GCCTCGCCGC GTTCCTCGGC GAGCTCGAGC GGATCGGCGG CGCCGGACTG TTCGGCTCCT ACGTCGACAC CGACGACCGC AACTCGGACC GCTACCTGTT CCACCTGCGC CAGGGCGGCC TCGGCCTGCC GGACGAGTCG TACTACCACG ACGACAAGTT CGCCGCGACC CGCCAGAAGT ACGTCGACTA CCTGACCCGG ATGCTCGGGC TGGGCGGGCA CCCCGACCCC GAGGGCGCCG CGCAGCGGAT CCTCGACGTG GAAACCCTGC TCGCGAAGGG CCACTGGGAG CGGGCCGAGA CCCGCGACGT CCAGAAGACC TACAACCTGA TGACGGGCGG ACAGCTGGCC GCGCTCTGCC CGGCGTTCGA CTGGGACGCC TACGTCACCG GTCTCGGTGG GTCGCTGACC GGCCCGCACG CGACGCTGGC GGAGGCGTGC GTGCGGCAGC CGTCGTTCTT CGAGCACCTG TCGACCGTGC TGACCGACAC GCCGGTCGAC GTGTGGCGCG ACTGGCTGGT CAGCCGCGTG CTGCGCTCGG CCGCGGCCTA CCTGCCCGAC GTGTTCACCG AGACCCACTT CGACTTCTAC GGCCGCACGC TCAGCGGCAC GCCCGAGCTG CGGGCCCGCT GGAAGCGCGC GGTGGCGTTC GTCGAGGGCG CGATCGGGGA GTCTGTCGGC AGGGAGTACG TCGCCCGGCA CTTCCCGCCG CACGCCAAGG CGCAGATGGA CGACCTCGTC GCGAACCTGC TCGCGGCCTA CCGCTCGTCG ATCTCCCAGC TGGACTGGAT GACGGAGGAG ACCAAGCAGC GGGCGTACGA GAAGCTCGAG ACGTTCCGGC CGAAGATCGG CTACCCCGAC CGGTTCCGGG ACTACTCGGC GCTGCCGGTC CGCCGCGGCG ACCTGATGGG CAACGCCCGC GCAGCCGCCG CGTTCGAGAC CGACCGGGAG CTGGCCAAGA TCGGCTCGCC GGTGGACCGC GACGAGTGGT TCATGCTCCC GCAGACCGTC AACGCCTACT ACAACCCGGG CACCAACGAG ATCTGCTTCC CGGCCGCCAT TCTCCAGAAG CCGTTCTTCA GCCCGGACGG CCACCCGGCC GAGAACTACG GCGGCATCGG CGCGGTGATC GGCCACGAGG TCGGTCACGG CTTCGACGAC CAGGGCGCGC AGTACGACGG CGCCGGCAAC CTCAACGACT GGTGGACGCC CGCCGACAAG GCGGCCTTCG AGGTGAAGTC GAAGACGCTG GTCGAGCAGT ACAACGGGTT CGAGTCGCGC AACCTGCCGG GCGAGAAGGT GAACGGCGCG CTCACTGTCG GGGAGAACAT CGGCGACCTC GGCGGCCTGA CCATCGCCCA CCAGGCCTAC GTCATCTCCC AGGACGGCGA GCCGTCGCGG GAGGACCGAC GTCGGCTGTT CATGAACTGG GCCTACGTGT GGCGCTCCAA GCGCCGGCTC GAGCTGGAGC GGCAGTACCT GACCACCGAC CCGCACAGCC CGCCGGACCT GCGCGCCAAC ATCGTGCGCA ACCTCGACGA GTTCCACGAC GTCTTCGGCA CCGAGCCCGG CGACGGGCTG TGGCTGGACC CGGCCGACCG GGTCCGCATC TGGTAG
|
Protein sequence | MNILDDAREG MDLDVRPQDD LFGHVNGRWL AETEIPSDRS SWGPFVQLAD DAERQVRDII TDLAARDQAT QGEDARKIGD LYNSFMDTEA LEALGLGPVR PLLDGARGLS DVRGLAAFLG ELERIGGAGL FGSYVDTDDR NSDRYLFHLR QGGLGLPDES YYHDDKFAAT RQKYVDYLTR MLGLGGHPDP EGAAQRILDV ETLLAKGHWE RAETRDVQKT YNLMTGGQLA ALCPAFDWDA YVTGLGGSLT GPHATLAEAC VRQPSFFEHL STVLTDTPVD VWRDWLVSRV LRSAAAYLPD VFTETHFDFY GRTLSGTPEL RARWKRAVAF VEGAIGESVG REYVARHFPP HAKAQMDDLV ANLLAAYRSS ISQLDWMTEE TKQRAYEKLE TFRPKIGYPD RFRDYSALPV RRGDLMGNAR AAAAFETDRE LAKIGSPVDR DEWFMLPQTV NAYYNPGTNE ICFPAAILQK PFFSPDGHPA ENYGGIGAVI GHEVGHGFDD QGAQYDGAGN LNDWWTPADK AAFEVKSKTL VEQYNGFESR NLPGEKVNGA LTVGENIGDL GGLTIAHQAY VISQDGEPSR EDRRRLFMNW AYVWRSKRRL ELERQYLTTD PHSPPDLRAN IVRNLDEFHD VFGTEPGDGL WLDPADRVRI W
|
| |