Gene Mflv_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0472 
Symbol 
ID4971548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp498488 
End bp500488 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content67% 
IMG OID640454677 
Productneprilysin 
Protein accessionYP_001131754 
Protein GI145221076 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0927032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0519696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTAG AAGCTACATC TGACACCTCT GCGAAGTCCG GCATCGATCT GCGCTACGTC 
GACGCCGACG CCCGCCCCCA GGACGACCTG TTCGGTCACG TCAACGGTCG CTGGCTGGCC
GAGTACCAGA TACCCGGCGA CCGGGCGACC GACGGCGCCT TCCGCACGCT CTACGACCGG
GCCGAGGAGC AGATCCGCGA TCTGATCACC GAAGCCGCGT CGGCACACGC CGCCGAGGGC
ACCGACCAGC AGCGGATAGG GGACCTGTAC GCGAGCTTCA TGGACGAGCA GACGGTCCGC
GACCGCGGCC TGGCGCCGTT GCTCGACGAG CTCGGGGCCA TCGACGCGGC CGGGTCACCC
GACGCGCTGG CCGAGGTCCT GGGCTCATTG CAACGCACCG GGATCGGCGG TGCGACCGGT
CTCTACGTCG ACACCGACTC GAAGAACTCC ACTCGTTACC TGCTGCATCT GACCCAGTCG
GGCATCGGGC TGCCCGACGA GTCGTACTTC CGCGAGGAGC AGCACGCCGA GATCCTGGCC
GCCTACCCCG GCCACATCGC GGCGATGTTC GGGCTCGTGC TGGGCGGGGA TCCCGGGGAG
CACGCCGCGA CGGCCCAACG GATCGTCGCG CTGGAGACCA AACTCGCCGC CGCGCACTGG
GATGTGGTCA AGCGGCGCGA CGCCGACCTG ACCTACAACC TGCGCACGTT CGCCGAACTG
ACCGACGAGT CACCGGGTTT CGACTGGACA CGCTGGCTCG GCGGGCTCGG TGCCGATCGG
GACAAGGCCG CCGACGTCGT GGTGCGCCAG CCGGACTATC TGACGGCGTT CGCATCGCTG
TGGAGCGGCT CAAGCCTTGA GGACTGGAAG GACTGGCTGA GGTGGCGCGT CATCCACGGC
CGGGCCTTCC TGCTCACCGA CGAGCTGATC GCGGAGGACT TCTCGTTCTA CGGCAAGCGC
CTCTCGGGCA CCGAGGAGAT CCGGGATCGC TGGAAGCGCG GCGTGTCGGT CGTCGAGGCC
CTGATGGGCG AGGCGCTGGG CAAGCTGTAT GTGGAGCGTC ATTTCCCGCC GCAGGCCAAG
GCCCGGATGG ACGAGTTGGT CGCCAACCTG CGCGAGGCCT ACCGGGTCAG CATCAACACG
CTGGACTGGA TGACGCCGCA GACACGCGAG AAGGCCCTGG TCAAGCTCGA CAAGTTCACG
CCGAAGATCG GCTACCCGAA CACGTGGCGC GACTATTCGG CACTGGTCAT CGAGCGTGAC
GACCTGTACG GCAACTACCG GCGCGGGTAT GCGCTGGAGT ACGACCGCGA TCTGGCGAAG
CTGGGCGGGC CGGTGGACCG CGACGAGTGG TTCATGACGC CGCAGACGGT CAACGCGTAC
TACAACCCGG GGATGAACGA GATCGTGTTC CCCGCGGCGA TCCTGCAGCC GCCGTTCTTC
GACGCCGATG CCGACGACGC GGCCAATTAC GGCGGTATCG GCGCGGTGAT CGGGCACGAG
ATCGGGCACG GGTTCGACGA CCAGGGCGCC AAGTACGACG GCGACGGCAA CCTGGTGGAC
TGGTGGACCG ACGAGGACCG CGCGGAATTC GGCAAGCGCA CAACGGCGTT GATCGAGCAG
TACGAGCAGT TCACCCCGCG CGGGCTGGAG CCCTCGCACC ACGTGAACGG CGCGTTCACC
GTCGGCGAGA ACATCGGTGA CCTCGGCGGG CTCTCGATAG CACTGCTCGC CTACCGGCTC
TCGCTCAAGG GTGAACCGGC GCCCGTCATC GACGGATTGA CGGGTGTGCA ACGAGTCTTC
TATGGGTGGG CGCAGGTGTG GCGCACGAAA TCCCGTGAGG CCGAGGCGAT CCGGCGGCTG
GCGGTGGACC CGCATTCACC ACCGGAGTTC CGGTGCAACG GCGTGATCCG CAACATGGAC
GCGTTCTACG ACGCGTTCGA CGTCGATCCC GAGGACGCCC TGTATCTGGA ACCTCAACGG
CGCGTGCACA TCTGGAACTG A
 
Protein sequence
MTVEATSDTS AKSGIDLRYV DADARPQDDL FGHVNGRWLA EYQIPGDRAT DGAFRTLYDR 
AEEQIRDLIT EAASAHAAEG TDQQRIGDLY ASFMDEQTVR DRGLAPLLDE LGAIDAAGSP
DALAEVLGSL QRTGIGGATG LYVDTDSKNS TRYLLHLTQS GIGLPDESYF REEQHAEILA
AYPGHIAAMF GLVLGGDPGE HAATAQRIVA LETKLAAAHW DVVKRRDADL TYNLRTFAEL
TDESPGFDWT RWLGGLGADR DKAADVVVRQ PDYLTAFASL WSGSSLEDWK DWLRWRVIHG
RAFLLTDELI AEDFSFYGKR LSGTEEIRDR WKRGVSVVEA LMGEALGKLY VERHFPPQAK
ARMDELVANL REAYRVSINT LDWMTPQTRE KALVKLDKFT PKIGYPNTWR DYSALVIERD
DLYGNYRRGY ALEYDRDLAK LGGPVDRDEW FMTPQTVNAY YNPGMNEIVF PAAILQPPFF
DADADDAANY GGIGAVIGHE IGHGFDDQGA KYDGDGNLVD WWTDEDRAEF GKRTTALIEQ
YEQFTPRGLE PSHHVNGAFT VGENIGDLGG LSIALLAYRL SLKGEPAPVI DGLTGVQRVF
YGWAQVWRTK SREAEAIRRL AVDPHSPPEF RCNGVIRNMD AFYDAFDVDP EDALYLEPQR
RVHIWN