Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0131 |
Symbol | |
ID | 4597580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 143575 |
End bp | 145524 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639774741 |
Product | endothelin-converting protein 1 |
Protein accession | YP_921363 |
Protein GI | 119714398 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3590] Predicted metalloendopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.029376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCATCC TCGACGAAGC CCGGGCAGGC ATGAGCCCCG AGATCCGGCC CCAGGACGAC CTCTTCGGCC ACGTGAACGG CCGCTGGCTG GACGAGACCG AGATCCCGGC GGACCGCTCG AGCTGGGGGC CCTTCATCCA GCTCGCCGAC ACCGCCGAGA CCCAGGTCCA CGAGATCATC GAGGACCTCG CGCGCCGGGT CGCGGCCGGC GAGCCGGTCG ACGAGGACGC CCACAAGATC GGCGACCTGT TCGCCTCGTT CATGGACACC GAGACGATCG CGCGCAACGG CCTGCGGCCG GTGCGCCCCC TGATCGAGGC CGTCGCGGGG CTGCGCGACG TCCGCGACCT CGCTGCGTTC CTCGGCGAGT TCGAGCGGAT CGGCGGCCAC GGCCTGTTCG GCTCCTACGT CGACACCGAC TCCAAGAACT CCGACCGCTA CCTGTTCAAC CTGGTGCAGG GCGGGCTCGG CCTGCCCGAC GAGTCCTACT ACCGCGACGA GAAGTTCGCG GAGATCCGCG AGAAGTACGT CGCCTACCTC ACCACCCTGT TCGGCCTGGG GGAGCACCCC GATCCTGCTG CCGCGGCCGC GACGGTGCTC GCCATCGACA CCCGGATGGC CGCGGGGCAC TGGGAGCGCG CCGAGACCCG CGACGTGCAG AAGACCTACA ACCTGATGAC CAGGGCCGAG CTGATCGAGC TCAGCCCGGG CTTCGACTGG GACGCCTACG TCACCAACCT CGGCGGCAAC GAGGAGACGC TCGCGGAGGT GTGCGTGCGG CAGCCGTCGT ACTTCACCCA CCTCTCGGTG CTCCTCGACG AGATCTCCCT CGAGGACTGG CGCGAGTGGC TGCTGGCGCA CGTGCTGCGG TCGGCGGCGG CGTACCTCAC CGACGACTTC GTCGAGACGA ACTTCGACTT CTACGGCCGG ACCCTCAGCG GCACGCCCGA GCTGCGGGCG CGGTGGAAGC GGGGGGTCGC GCTGGTCGAG GGCGCGATCG GCGAGGCGGT CGGCAAGGAG TACGTCGCAC GGCACTTCCC GCCCCGGTCG AAGGCGATGA TGGACGAGCT GGTCGCGAAC CTGCTCGCCG CCTACCGCCA GTCCATCTCC CGGCTCGACT GGATGACCGA GGAGACCAAG CAGCGCGCGT ACGACAAGCT CGACAGGTTC CGGCCCAAGA TCGGCTACCC GGAGAAGTTC CGCGACTACT CCGCGCTCCG GGTGACCCGC GACGACCTGC TCGGCAACGT CGCCGCCGCG TCGGCGTTCG AGACCGACCG GCAGCTCGCG AAGATCGGCT CGCCGGTGGA CCGCGACGAG TGGTTCATGC TCCCCCAGAC CGTCAACGCC TACTACAACC CCGGCACCAA CGAGATCTGC TTCCCCGCCG GCATCCTGCA GAAGCCGTTC TTCTCCCCGG ACGCCGAGGA GGCCGAGAAC TACGGCGGCA TCGGCGCGGT CATCGGCCAC GAGATCGGGC ACGGCTTCGA CGACCAGGGC GCGCAGTACG ACGGCAGCGG CAACCTGCAC GACTGGTGGA CCCCCGACGA CAAGGCCGCG TTCGAGGTGA AGTCGAAGGC CCTCATCGAG CAGTACGACG GCTTCGAGCC CCGCACGCTG CCCGGCGAGC GCGTCAACGG CGCGCTCACC GTCGGTGAGA ACATCGGCGA CCTCGGCGGG CTGACCATCG GCCACACCGC CTACCTGATC GCCCGCGGCG GGAGCGCGTC CGTCGAGGAC CGGCAGAAGG TGTTCCTGAA CTGGGCCTAC TGCTGGCGGA CCAAGCGGCG CAAGGAGCAG GAGCAGCAGT ACCTCACCAT CGACCCGCAC TCCCCGGCGG AGTTCCGTGC GAACATCGTG CGCAACCTCG ACGAGTTCCA CGAGGTGTTC GGCACCGTCG AGGGGGACGG GCTCTGGCTG GACCCCGACC AGCGGGTGCG CATCTGGTGA
|
Protein sequence | MSILDEARAG MSPEIRPQDD LFGHVNGRWL DETEIPADRS SWGPFIQLAD TAETQVHEII EDLARRVAAG EPVDEDAHKI GDLFASFMDT ETIARNGLRP VRPLIEAVAG LRDVRDLAAF LGEFERIGGH GLFGSYVDTD SKNSDRYLFN LVQGGLGLPD ESYYRDEKFA EIREKYVAYL TTLFGLGEHP DPAAAAATVL AIDTRMAAGH WERAETRDVQ KTYNLMTRAE LIELSPGFDW DAYVTNLGGN EETLAEVCVR QPSYFTHLSV LLDEISLEDW REWLLAHVLR SAAAYLTDDF VETNFDFYGR TLSGTPELRA RWKRGVALVE GAIGEAVGKE YVARHFPPRS KAMMDELVAN LLAAYRQSIS RLDWMTEETK QRAYDKLDRF RPKIGYPEKF RDYSALRVTR DDLLGNVAAA SAFETDRQLA KIGSPVDRDE WFMLPQTVNA YYNPGTNEIC FPAGILQKPF FSPDAEEAEN YGGIGAVIGH EIGHGFDDQG AQYDGSGNLH DWWTPDDKAA FEVKSKALIE QYDGFEPRTL PGERVNGALT VGENIGDLGG LTIGHTAYLI ARGGSASVED RQKVFLNWAY CWRTKRRKEQ EQQYLTIDPH SPAEFRANIV RNLDEFHEVF GTVEGDGLWL DPDQRVRIW
|
| |