Gene Namu_2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2374 
Symbol 
ID8447985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2614358 
End bp2616058 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content67% 
IMG OID645041494 
ProductM6 family metalloprotease domain protein 
Protein accessionYP_003201738 
Protein GI258652582 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00239302 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000554149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGATC CGCAGATCCA TGCCCGAGCC ATTCACCGCG AGGCGTGTTT CGTCGCGCCG 
AGCCCGGAAC TGGCCGAGCG TTGGAAGGCC GACCTCGCCG GACTGCGCGG GGGCGGTGTC
GGCTCCGACA TCGCCTCGGT CCTGGCGATC GCCCGCCAAC CCAGACCGCT GGGCTTCGAC
GACGGCGTCA TCCTGCCGCC CGAGGAATAC CCGGTGGACA CACCGCTGAC CGCCATCCGC
AATGCCGCCG CCGACCGCGC TCCGTTGCGT GGCACCGTCC GGGTCATCGT GGTGCTGGTG
GACTTCAGCG ACAAGAAGAT GACCGCGACA ACCGACCACT TCCGGGACCT GTTCTTCTCC
ACCGGCGTCC TGCCGCACGG CAGCGTGCGC GAGTACTACC GCGAGGTCAC CAACAATCTC
GTCGACCTGG ACGGCGAGGT GGTGGGGCCG TTCCGGATGC CGCAGACGTT GGCCTGGTAC
GCCAACGGGA ACTTCGGCAT CGGCCGGCCG ACGGGCACGA CCCGGGCCCG CGACATGGCC
ATGGACGCGT TCCTGGCGGC CAACCCCAGT GTGAACTTCG GTCCCTACGA CAACGACGGC
AACGGCTACG TCGACGCGTT CATCGTCATC CATGCCGGGA CCGGCGGCGA GGCGTCCGGC
AACAGTGGCG ACATCTGGTC GCACAAATGG ACTCTCACCT CGCAGCAGAA CGCGGACGGG
ACCAAGGTCT ACGGCTACCT GACCATCCCG GAGGACGCCA AGATCGGCGT CAGCGCGCAC
GAGCTGGGCC ATCTGCTCTT CGGCTTCCCC GACCTGTACG ACACCGACAA CACCTCGGAG
GGCATCGGGA ACTGGTGCCT GATGGCCGCC GGCTCGTGGG GCGGTGGGGG CGACGTCCCG
GTCCACCCAT CGGCCTGGTG CAAGGCCAAT CAGGGCTGGG CCGCGGTCAC CAACGTGACG
GCCAACGGGC CGGCCACGAT CCCCGACGTC AAGGCCAGCC ACACGGTGCA TCGCCTGTGG
GAGGACGGTG CGGCCGGGCA GGAGTATTTC CTGGTCGAGA ATCGGCAACA GACCGGTTAC
GACGTCAGCC TGCCGGCCGG CGGATTGCTC ATCTGGCACA TCGACGACGC GCAAACCTCC
AACACCGACG AGAACCACTA CAAGGTGGCG CTCATGCAGG CCGACGGGCG ACGCGACCTG
GAGCTCAACC ACAATCGGGG CGATGCCGGT GACCCGTACC CCGGATCCGC GGCCAACACC
AGCTTCTCGT CGTCGTCCAC CCCGAACTCA CACTCCTACG CCGGCGCGGA CACCTGTGTG
TCGGTCACCG GGATCTCGGC GGCCGGAGCC AGCATGACCG CCCAGCTCAC CGTCAGCTGC
GGCAAGTCGG TGGTCAAGGA CGCCAAGGAC CACAAGGACG GCGTGAAGGA AGCCAAGGAG
CCGGTCAAGG AACGGAAGGA CATCAAGGAC CACAAGGACG GCATCAAGGA TCGAAAGGAC
GGCAAGGACG GCAAGGAACC CGTCAAGGAA CGCAAGGACA TCAAGGACCA CAAGGACGGC
GTCAAGGACG TCAAGGAGCC GTTCAAGGAA CGCAAGGACA TCAAGGATCA CACCGAGGGC
AAGGGCCCGC TGGCCGATCG GCCGCCGCTG CCCCCGGGCC GGGCCCCCTC GCGGCGTACG
CCGACCAGGG CGGCGACATG A
 
Protein sequence
MADPQIHARA IHREACFVAP SPELAERWKA DLAGLRGGGV GSDIASVLAI ARQPRPLGFD 
DGVILPPEEY PVDTPLTAIR NAAADRAPLR GTVRVIVVLV DFSDKKMTAT TDHFRDLFFS
TGVLPHGSVR EYYREVTNNL VDLDGEVVGP FRMPQTLAWY ANGNFGIGRP TGTTRARDMA
MDAFLAANPS VNFGPYDNDG NGYVDAFIVI HAGTGGEASG NSGDIWSHKW TLTSQQNADG
TKVYGYLTIP EDAKIGVSAH ELGHLLFGFP DLYDTDNTSE GIGNWCLMAA GSWGGGGDVP
VHPSAWCKAN QGWAAVTNVT ANGPATIPDV KASHTVHRLW EDGAAGQEYF LVENRQQTGY
DVSLPAGGLL IWHIDDAQTS NTDENHYKVA LMQADGRRDL ELNHNRGDAG DPYPGSAANT
SFSSSSTPNS HSYAGADTCV SVTGISAAGA SMTAQLTVSC GKSVVKDAKD HKDGVKEAKE
PVKERKDIKD HKDGIKDRKD GKDGKEPVKE RKDIKDHKDG VKDVKEPFKE RKDIKDHTEG
KGPLADRPPL PPGRAPSRRT PTRAAT