Gene Ndas_4970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4970 
Symbol 
ID9248859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp110902 
End bp112959 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content68% 
IMG OID 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003682858 
Protein GI297563885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.259213 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGA AGCGTTTTTT CCGCGGTCCG TGGATCTGGA TCCTGGCCGT CGGCCTCATG 
CTCATCATCG GCTTCCGCGT GTTCGACGTC GGTGGCGGTC CCCAACCCAC GGAGACCGAC
ATCTCGACCG TCCACCAGCT CATCGAGCGG GACGAGGTGG ACAACGCCGA GATCGTCGAC
CGCGACCAGC GCATCGTGCT GACCACCGTG GACGACGAGA TCTACGAGGC GTACTGGGTC
GAGGGGCAGG GCCTCCAGCT CGCGGAGATG CTCCAGACCT CCCTGGACAG CGAGGGCTCC
AACCTGGAGT CCTACGAGGT GTCGGTCGCC ACCACACCGG TGTGGACCAC CGCGCTGTTC
AGCCTGCTGC CGCTGGTCAT CATCATCGTC ATCTTCTGGT TCATCTTCAG CCAGATGCAG
GGCGGCGGCT CGCGCGTGAT GAACTTCGGC AAGTCCAAGG CCAAGCTCAT CACCAAGGAC
ACGCCCAAGA ACACCTTCGC CGACGTCGCC GGGGCCGACG AGGCCATCGA GGAGCTCCAC
GAGATCAAGG AGTTCCTCCA GAACCCGGCC AAGTTCCAGG CGATGGGCGC CAAGATCCCC
AAGGGCGTGC TCCTGATGGG CCCGCCCGGA ACCGGTAAGA CCCTCCTGGC CCGCGCCGTC
GCCGGCGAGG CCGGGGTCCC GTTCTACTCC ATCTCCGGCT CGGACTTCGT CGAGATGTTC
GTCGGTGTGG GTGCCTCCCG CGTGCGCGAC CTGTTCGAGC AGGCCAAGGC CAACGCCCCC
GCGATCATCT TCATCGACGA GATCGACGCC GTCGGCCGCC ACCGCGGCGC CGGCATGGGC
GGCGGGCACG ACGAGCGCGA GCAGACCCTC AACCAGATGC TGGTCGAGAT GGACGGCTTC
GACGTCAAGG GCGGCGTCAT CCTCATCGCC GCCACCAACC GGCCCGACAT CCTCGACCCG
GCGCTGCTGC GCCCCGGCCG CTTCGACCGG CAGGTCGTCG TGGACCGCCC GGACATGGAC
GGCCGCCGCG ACATCCTCAA GGTCCACGCC AAGGGCAAGC CCATGGCGGA CGACGTCGAC
TTCAACGTCA TCGCCCGCCA GACCGCCGGG ATGACCGGCG CCGACCTGGC CAACGTCATC
AACGAGGGCG CCCTCCTGTC CGCGCGCGCC GACCGGAACG TCATCACCCA CGCCGTCCTG
GAGGAGGCGA TCGAACGGGT CATGGCCGGT CCCGAGCGCA AGACCCGGGT GATGTCCGAC
CGGGAGAAGA AGGTCATCGC CTACCACGAG GGCGGCCACG CCCTGGTGGG CCACGCGCTG
CCCAACTCCG ACCCGGTGCA CAAGATCACC ATCCTGCCGC GCGGCCGGGC CCTGGGCTAC
ACGATGTCGG TGCCGACGGA GGACAAGTTC CTCACGTCGC GTTCGCAGAT GATGGACCAG
CTCGCGATGA TGCTCGGAGG TCGCGCCGCG GAGGAGCTCG TCTTCCACGA GCCCACCACC
GGCGCGGGCA ACGACATCGA CAAGGCCACC AGCCTGGCCC GCAACATGGT GACCGAGTAC
GGCATGAGCG AGCGCCTGGG CGCCCGCAAG TTCGGCTCCG GCAACACCGA ACCCTTCCTG
GGCCGGGAGA TGTCGCACGC CCGCGAGTAC TCCGAGGAGA TCGCCTCCAT CATCGACGAG
GAGGTGCGCC GCCTCATCGA GTCCGCGCAC GACGAGGCCT ACGAGGTCCT CGTCGAGTAC
CGGGACGTCC TGGACGACCT GGTCGTGGCG CTGCTGGAGA AGGAGACCCT GTCCAAGGCC
CAGGTGCTGG AGATCTTCGC GCCGGTGGTC AAGCGACCCT CCCGCGGCTC CTACACGGGG
TACGGCAAGC GCCAGCCCTC GGAGCGGCCC CCGGTGCGCT CCAAGAAGGA GCTGGCCGTG
CTCAACGGCG CCGAGGTCCC GGGCGCCGAC GCGGTGCAGG GCTCCACTGA CGTCCAGGAC
CCCCAGGCGG TCCGGGGCTC CTCGACCAAC GGACAGCTCC CGGGGCCGTC CGACAGAGGC
GAGGAGGAGA GCTCGTGA
 
Protein sequence
MNLKRFFRGP WIWILAVGLM LIIGFRVFDV GGGPQPTETD ISTVHQLIER DEVDNAEIVD 
RDQRIVLTTV DDEIYEAYWV EGQGLQLAEM LQTSLDSEGS NLESYEVSVA TTPVWTTALF
SLLPLVIIIV IFWFIFSQMQ GGGSRVMNFG KSKAKLITKD TPKNTFADVA GADEAIEELH
EIKEFLQNPA KFQAMGAKIP KGVLLMGPPG TGKTLLARAV AGEAGVPFYS ISGSDFVEMF
VGVGASRVRD LFEQAKANAP AIIFIDEIDA VGRHRGAGMG GGHDEREQTL NQMLVEMDGF
DVKGGVILIA ATNRPDILDP ALLRPGRFDR QVVVDRPDMD GRRDILKVHA KGKPMADDVD
FNVIARQTAG MTGADLANVI NEGALLSARA DRNVITHAVL EEAIERVMAG PERKTRVMSD
REKKVIAYHE GGHALVGHAL PNSDPVHKIT ILPRGRALGY TMSVPTEDKF LTSRSQMMDQ
LAMMLGGRAA EELVFHEPTT GAGNDIDKAT SLARNMVTEY GMSERLGARK FGSGNTEPFL
GREMSHAREY SEEIASIIDE EVRRLIESAH DEAYEVLVEY RDVLDDLVVA LLEKETLSKA
QVLEIFAPVV KRPSRGSYTG YGKRQPSERP PVRSKKELAV LNGAEVPGAD AVQGSTDVQD
PQAVRGSSTN GQLPGPSDRG EEESS