Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1374 |
Symbol | |
ID | 9145258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 1525642 |
End bp | 1527630 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Neprilysin |
Protein accession | YP_003636471 |
Protein GI | 296129221 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.367096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00589485 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCGCA GCGGAGTGCC CCTCGACGAC CTCGACCCGT CCGTCCGGCC GCAGGACGAC CTCGACCTCT TCGTCAACGG CCGGTGGGCC GCCTCGTACG TGATCCCACC GGACCGGTCG ATGGACGGCC CGTTCCGGGC GCTGTACGAC GAGGCGGAGC GCCAGGTCCT GGACATCATC ACCGACGCCG CGCAGGCGGC GGGCGAGGGC GACGGCGTCG AGGCCAAGAT CGGGGCGCTG TACGCGAGCT TCATGGACAC CGACGCGGTT CGGGCGGCGG GCGTCGAGCC GCTGCGCGAG GACCTCGCGC TGGTCGACGC CGCCACGACG CCGGCGGAGC TGACGGTCGC GGTGGGCCGG CTGCAGCGCA CCGGCGCGCT GTCCGCGGTC GACCTGTACG TCGACAACGA CGCCAAGGAC CCCGACTCGT ACGTCGTGCA CCTCGTGCAG GGCGGGCTGG GCCTGCCCGA CGAGGCGTAC TACCGCGAGG AGCAGCACGC GGCCGTGCGC GAGAAGTACC TGCCGCACGT CGCCCGCATG CTGCGGCTCG CCGCGCCCGT CTCCGGCGTC GTCGCCGCGG GCGACGCGGA CGACCTCGCG GCGCGCGTCG TCGCGCTGGA GTCGCGCATC GCGGCGCACC ACTGGGACGT CGTCAAGGAC CGCGACGCCG AGCTGACGTA CAACGCGCTC ACGCTCGCCG AGCTCGCCGC GCGGGCGCCG GGGTTCGACT GGCGCGCGTG GGCCGAGGCG CTCGGCGCGC CGGCCGGCGC GCTCGACCGC CTCGTGGTCC GCGAGCCGTC GTTCGCCGAG GGGCTGGCGG CGCTGTGGAC CGAGGTGCCG GTCGCGGACT GGCAGGCGTG GGCCACCTAC CACGTGGTGT CGTCGCGCGC GCCGTACCTC ACGGACGAGC TCGTCGAGGC GAACTTCGAC TTCTACGGGC GCACGCTGTC CGGCGCGCCG GAGCTCCGTG ACCGCTGGAA GCGGGGCGTG TCCCTGGTCC AGGGGGCGCT CGGCGAGGCC GTGGGCAAGG TGTACGTCGA ACGGCACTTC CCGCCGTCGC ACAAGGAGCG CATGGACGAG CTCGTCGCGA ACCTCGTCGA GGCGTACCGC CGGTCGATCA CCGAGCTCGA GTGGATGGGC GAGGAGACGC GGCAGCGCGC GCTGGAGAAG CTGGCGAGGT TCACGCCCAA GATCGGGTAC CCCGCGAGGT GGCGGGACTA CTCGGCGCTC GAGGTGCGTG CCGACGACCT GGTGGGCAAC GTGCGGCGGT CGAACGCGTT CGACCTCGAC CGCGAGCTCG GCAAGATCGG GAGGCCGATC GACCGCGACG AGTGGTTCAT GACGCCGCAG ACCGTCAACG CCTACTACAA CCCCGGCATG AACGAGATCG TCTTCCCCGC GGCGATCCTG CAGCCGCCGT TCTTCGACGC CGAGGCGGAC GACGCCGCCA ACTACGGCGG CATCGGCGCG GTCATCGGCC ACGAGATCGG GCACGGGTTC GACGACCAGG GCTCGAAGTA CGACGGCGAC GGCCGCCTCG TCGACTGGTG GACGGCCGAG GACCGCGCGG AGTTCGAGCG CCGCACGAAG TCGCTCGTCG ACCAGTACGC CCAGTACTCG CCCCGGCAGC TGGGCGGCAG CCACCGCGTC AACGGCGAGC TGACGATCGG CGAGAACATC GGCGACCTCG GCGGCCTGTC GATCGCGGTG CGTGCGTACG AGATCGCGCT GGGCCACCCC CTGGACGAGG CACCCGTGCT CGACGGGTAC ACGGGCCTGC AGCGCCTGTT CATGGGCTGG GCGCACTCGT GGCGCACCAA GGGCCGCGAC GAGGAGGTGA TCCGCCGGCT CGCGACGGAC CCGCACTCCC CCGACGAGTT CCGCTGCAAC GGCGTCGTGC GGAACATCGA CGAGTTCTAC ACGGCGTTCG ACGTGCAGCC GGACGACGCC CTGTGGCTCG ACCCGGAGCA GCGCGTCCGC ATCTGGTGA
|
Protein sequence | MTRSGVPLDD LDPSVRPQDD LDLFVNGRWA ASYVIPPDRS MDGPFRALYD EAERQVLDII TDAAQAAGEG DGVEAKIGAL YASFMDTDAV RAAGVEPLRE DLALVDAATT PAELTVAVGR LQRTGALSAV DLYVDNDAKD PDSYVVHLVQ GGLGLPDEAY YREEQHAAVR EKYLPHVARM LRLAAPVSGV VAAGDADDLA ARVVALESRI AAHHWDVVKD RDAELTYNAL TLAELAARAP GFDWRAWAEA LGAPAGALDR LVVREPSFAE GLAALWTEVP VADWQAWATY HVVSSRAPYL TDELVEANFD FYGRTLSGAP ELRDRWKRGV SLVQGALGEA VGKVYVERHF PPSHKERMDE LVANLVEAYR RSITELEWMG EETRQRALEK LARFTPKIGY PARWRDYSAL EVRADDLVGN VRRSNAFDLD RELGKIGRPI DRDEWFMTPQ TVNAYYNPGM NEIVFPAAIL QPPFFDAEAD DAANYGGIGA VIGHEIGHGF DDQGSKYDGD GRLVDWWTAE DRAEFERRTK SLVDQYAQYS PRQLGGSHRV NGELTIGENI GDLGGLSIAV RAYEIALGHP LDEAPVLDGY TGLQRLFMGW AHSWRTKGRD EEVIRRLATD PHSPDEFRCN GVVRNIDEFY TAFDVQPDDA LWLDPEQRVR IW
|
| |