Gene Namu_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3157 
Symbol 
ID8448771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3479268 
End bp3480593 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content73% 
IMG OID645042238 
ProductHNH endonuclease 
Protein accessionYP_003202479 
Protein GI258653323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000184807 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000205863 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGACC TGGCGGGCGA GCTGACGGAC GCGGACATGC TGGCGATGTG GGCAAACGCT 
CCGATCAACC CTCAACCTCT AGTTGAGAGT GATCCCCCGG ATGAGGCGTT GGCCGAGCTG
GAGAGCCGGA TCACCTCGAT GGCGGCACGG TTGGCCGCGC AGACCCGGGA GTGGCTGGCC
CTGGTCGCCG AGTTCAACCG GCGCAAGGGG TGGGTGCAGT GGGGGATGCG GTCGATGGCG
CACTGGCTGT CCTGGTCATG TTCGGTCGGG CCGGGGGTGG CCCGGGAGTA CGTGCGGGTC
GCGACCGCGC TGACCGAGTT GCCGCTGGTG GACGAGGCGT TCGCGCAGGG GCAGCTGTCG
TATTCCAAGG TGCGGGCGGT GACCCGGGTC GCCGACCAGG TGGACCAGAC CACGCTGCTG
GAGCAGGCCA AGGTGCATTC CGCGGCCCAG CTGGAGAAGG TGATCCGCGG CTACCGCAAG
GCGCAGCGGC CGGACCGGCC GGTCGAGCAG CGCCGCAAGG CGCGCTGGTT CTACGACGAG
GACGGGATGC TGGTGCTGTC CGCGCGGTTG ACCGCGGACG AGGGGGCGTT GCTGGTCGCC
GCGCTGGAGC AGGCCCGGGG CACCGGGCTG GGCAAGGACG ATCCGCTGCC CGGCGACGCC
GACGCGCTGG TCGCGCTGGC GCAGACCGCG CAGGCCGCTG GCGCGGTGGA CTCCTCGGGG
GACGACCGGC ACCTGGTGGT GGTGCACGCC GACGCCGACG TGCTGATCGG CGCCGACCAG
TCGCCCGATG CGATCTGCCG GATCGAGCAC GGCCCCGGCC TGACCGCCGA CGCGGCCCGC
CGGCTGGCCT GTGACGCGGC GCTGATCGCC TGGGTGTCCT CGGCGGTCTC GCCGGGCAAG
AACCTGCGGC TGGGCCGCAA GACCCGCAAG ATCCCGCCGG CGCTGCGCCG GGCGTTGCGG
CTGCGCGACG GCGGCTGCCG GTTTCCCGGC TGCCCGCGGA TGCGGTTCCT GGACGCACAC
CACGTTATCC ACTGGGCCGA CGGAGGCCCG ACGGATCTGG AGAACCTGAT CCTGCTGTGC
GGGCGGCACC ACCGGTCGAT GCACGAGGAG GGATTCACCC TGGTCCAGGA TGGGCCACAA
CGCTGGTCGG TCCGCCGGCC CGACGGGACC ACGATCCCCG CCGCGCCGCC CCTGCCCCTG
ACGCCGCCCC CGGACGTTCC CGCGGAAACG GAGTACGACC CGGACGCCCT GCGTCCCGAC
CAGCACGGCG AGCCGTTCAG CCTGCGCGAC TCGGTCGACG TGTTCTGCCG GAACCCGCGG
CCATGA
 
Protein sequence
MTDLAGELTD ADMLAMWANA PINPQPLVES DPPDEALAEL ESRITSMAAR LAAQTREWLA 
LVAEFNRRKG WVQWGMRSMA HWLSWSCSVG PGVAREYVRV ATALTELPLV DEAFAQGQLS
YSKVRAVTRV ADQVDQTTLL EQAKVHSAAQ LEKVIRGYRK AQRPDRPVEQ RRKARWFYDE
DGMLVLSARL TADEGALLVA ALEQARGTGL GKDDPLPGDA DALVALAQTA QAAGAVDSSG
DDRHLVVVHA DADVLIGADQ SPDAICRIEH GPGLTADAAR RLACDAALIA WVSSAVSPGK
NLRLGRKTRK IPPALRRALR LRDGGCRFPG CPRMRFLDAH HVIHWADGGP TDLENLILLC
GRHHRSMHEE GFTLVQDGPQ RWSVRRPDGT TIPAAPPLPL TPPPDVPAET EYDPDALRPD
QHGEPFSLRD SVDVFCRNPR P