Gene Cfla_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1374 
Symbol 
ID9145258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1525642 
End bp1527630 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content72% 
IMG OID 
ProductNeprilysin 
Protein accessionYP_003636471 
Protein GI296129221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.367096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00589485 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGCA GCGGAGTGCC CCTCGACGAC CTCGACCCGT CCGTCCGGCC GCAGGACGAC 
CTCGACCTCT TCGTCAACGG CCGGTGGGCC GCCTCGTACG TGATCCCACC GGACCGGTCG
ATGGACGGCC CGTTCCGGGC GCTGTACGAC GAGGCGGAGC GCCAGGTCCT GGACATCATC
ACCGACGCCG CGCAGGCGGC GGGCGAGGGC GACGGCGTCG AGGCCAAGAT CGGGGCGCTG
TACGCGAGCT TCATGGACAC CGACGCGGTT CGGGCGGCGG GCGTCGAGCC GCTGCGCGAG
GACCTCGCGC TGGTCGACGC CGCCACGACG CCGGCGGAGC TGACGGTCGC GGTGGGCCGG
CTGCAGCGCA CCGGCGCGCT GTCCGCGGTC GACCTGTACG TCGACAACGA CGCCAAGGAC
CCCGACTCGT ACGTCGTGCA CCTCGTGCAG GGCGGGCTGG GCCTGCCCGA CGAGGCGTAC
TACCGCGAGG AGCAGCACGC GGCCGTGCGC GAGAAGTACC TGCCGCACGT CGCCCGCATG
CTGCGGCTCG CCGCGCCCGT CTCCGGCGTC GTCGCCGCGG GCGACGCGGA CGACCTCGCG
GCGCGCGTCG TCGCGCTGGA GTCGCGCATC GCGGCGCACC ACTGGGACGT CGTCAAGGAC
CGCGACGCCG AGCTGACGTA CAACGCGCTC ACGCTCGCCG AGCTCGCCGC GCGGGCGCCG
GGGTTCGACT GGCGCGCGTG GGCCGAGGCG CTCGGCGCGC CGGCCGGCGC GCTCGACCGC
CTCGTGGTCC GCGAGCCGTC GTTCGCCGAG GGGCTGGCGG CGCTGTGGAC CGAGGTGCCG
GTCGCGGACT GGCAGGCGTG GGCCACCTAC CACGTGGTGT CGTCGCGCGC GCCGTACCTC
ACGGACGAGC TCGTCGAGGC GAACTTCGAC TTCTACGGGC GCACGCTGTC CGGCGCGCCG
GAGCTCCGTG ACCGCTGGAA GCGGGGCGTG TCCCTGGTCC AGGGGGCGCT CGGCGAGGCC
GTGGGCAAGG TGTACGTCGA ACGGCACTTC CCGCCGTCGC ACAAGGAGCG CATGGACGAG
CTCGTCGCGA ACCTCGTCGA GGCGTACCGC CGGTCGATCA CCGAGCTCGA GTGGATGGGC
GAGGAGACGC GGCAGCGCGC GCTGGAGAAG CTGGCGAGGT TCACGCCCAA GATCGGGTAC
CCCGCGAGGT GGCGGGACTA CTCGGCGCTC GAGGTGCGTG CCGACGACCT GGTGGGCAAC
GTGCGGCGGT CGAACGCGTT CGACCTCGAC CGCGAGCTCG GCAAGATCGG GAGGCCGATC
GACCGCGACG AGTGGTTCAT GACGCCGCAG ACCGTCAACG CCTACTACAA CCCCGGCATG
AACGAGATCG TCTTCCCCGC GGCGATCCTG CAGCCGCCGT TCTTCGACGC CGAGGCGGAC
GACGCCGCCA ACTACGGCGG CATCGGCGCG GTCATCGGCC ACGAGATCGG GCACGGGTTC
GACGACCAGG GCTCGAAGTA CGACGGCGAC GGCCGCCTCG TCGACTGGTG GACGGCCGAG
GACCGCGCGG AGTTCGAGCG CCGCACGAAG TCGCTCGTCG ACCAGTACGC CCAGTACTCG
CCCCGGCAGC TGGGCGGCAG CCACCGCGTC AACGGCGAGC TGACGATCGG CGAGAACATC
GGCGACCTCG GCGGCCTGTC GATCGCGGTG CGTGCGTACG AGATCGCGCT GGGCCACCCC
CTGGACGAGG CACCCGTGCT CGACGGGTAC ACGGGCCTGC AGCGCCTGTT CATGGGCTGG
GCGCACTCGT GGCGCACCAA GGGCCGCGAC GAGGAGGTGA TCCGCCGGCT CGCGACGGAC
CCGCACTCCC CCGACGAGTT CCGCTGCAAC GGCGTCGTGC GGAACATCGA CGAGTTCTAC
ACGGCGTTCG ACGTGCAGCC GGACGACGCC CTGTGGCTCG ACCCGGAGCA GCGCGTCCGC
ATCTGGTGA
 
Protein sequence
MTRSGVPLDD LDPSVRPQDD LDLFVNGRWA ASYVIPPDRS MDGPFRALYD EAERQVLDII 
TDAAQAAGEG DGVEAKIGAL YASFMDTDAV RAAGVEPLRE DLALVDAATT PAELTVAVGR
LQRTGALSAV DLYVDNDAKD PDSYVVHLVQ GGLGLPDEAY YREEQHAAVR EKYLPHVARM
LRLAAPVSGV VAAGDADDLA ARVVALESRI AAHHWDVVKD RDAELTYNAL TLAELAARAP
GFDWRAWAEA LGAPAGALDR LVVREPSFAE GLAALWTEVP VADWQAWATY HVVSSRAPYL
TDELVEANFD FYGRTLSGAP ELRDRWKRGV SLVQGALGEA VGKVYVERHF PPSHKERMDE
LVANLVEAYR RSITELEWMG EETRQRALEK LARFTPKIGY PARWRDYSAL EVRADDLVGN
VRRSNAFDLD RELGKIGRPI DRDEWFMTPQ TVNAYYNPGM NEIVFPAAIL QPPFFDAEAD
DAANYGGIGA VIGHEIGHGF DDQGSKYDGD GRLVDWWTAE DRAEFERRTK SLVDQYAQYS
PRQLGGSHRV NGELTIGENI GDLGGLSIAV RAYEIALGHP LDEAPVLDGY TGLQRLFMGW
AHSWRTKGRD EEVIRRLATD PHSPDEFRCN GVVRNIDEFY TAFDVQPDDA LWLDPEQRVR
IW