Gene Namu_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4047 
Symbol 
ID8449667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4462240 
End bp4463790 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content74% 
IMG OID645043093 
ProductHNH endonuclease 
Protein accessionYP_003203328 
Protein GI258654172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0910573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.468754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGTCG ACGACCACTG GGGCGGCCGC CCGGTGCGGG TCTGGTCCGG GGCCCGGCCC 
CCGGCCCGCT GGGGCCTGCC GGAGTCGTTG ACCCACGGGG TCGTGTTGCC CTTCCGTGCC
AGCGCCGCTC GCGCGGCCGT CCGTGTGCGG TCCTGGCGGG AGACCCTGGC CACCACGGGC
CCGGGGGCCG ACCTGGCCCG GGTGCTGGAG ACCCGGCCCG ACGGCCCCGT CGACCGAGCC
GACGGCTCGA CCGACTGCTC GGCTGAGGCC ACCACGACGT CAGTGCCGCT GTGGGCGAAC
ACGTTGATCG ATTCGCTCCA GGCCCGCCAA TCCCTGATGG CGCACCTGCA GGCCCGGCAG
CAGGCCGATC TGGCCGAGCT GTCCGGCTGC TACCCGGGCC TGCACGAGTT CCTGGCCACC
GAGATCGCCC TCGCCCTGGG CATCGCCGAG GGCACCGCGA CCCGGCAACT GGCCGAGGCC
GTCGACCTGA CCACGCGGCT GCCCGAGACC TTCGCCGCGC TGGACGAGGG CCGGATCACC
CCGGTCAAGG CATCCACCAT CCGGTCCCAC ACCCAGGATC TGGACCTGGA TCAGTCCGCC
CTGGTCGAGG CCGACGTCCT GCCGGCGGCG CCCCGGGGGA CCGTGCCCGA ACTGCGACAT
GCCGTGCAGC GCGCGGTCAT CCGGCACGAC CCGCACGGCG CCGACGACCG GCATCAGGCC
CAGCGCGAGC GCCGCCGGGT CAGTGTCAAG GCCCAGCCCG ACGGGATGGG CAGCCTGTGG
CTGCTGTCCA CCGCGCAGGA CGTCGCCACC ATCCAAGCCT GCCTGGTCGC CGTGGGTGAC
GCGGCCGCCT CCCCGGACGA CGGTCGCACC GCCGACGCCC GCCGGGTCGA TGCCATGGTC
GACCTGTGCG CCGAGGTCCT TGACTCGGGG CAATGGCGGG GCACGGCCCT GCCCACCCGG
CAGCGCCGCC GCCCGCACGT CCAGGTGACC GTCCCGATCA CCGCGCTGCT CGACCCCGCC
ACGACGGCCG GCCAGAGCGC CGAGTTGCAC GGATACGGCC CCATCACTGC GGCCCAGGCC
GCCCAGATCA CCGCCGACGC CACCCTGCGC CGCCTGGTCT GCGACCCGCT GACCGGAGCG
CTGCTGGATT ACGGCCGGAC CACCTACCAC CCACCGGCCG CGCTGGCCGA CCACGTGCTG
GTCCGCGACC AGACCTGCCG GCTGCCCGGC TGCCGACAGC CCGCGCAGCG CTGCGAGCTC
GACCACGTCG AACCGTTCCG CCCGGGCCAC GACACCGGCG GAACCACCAG CGCGACCAAT
CTGTGCCTGG TCTGCAAACA CCACCACCGG GCCAAGGACG GCGGCGAGTT CCTTCTCCGG
CGCACCGCCG ACGGCTACGA CTGGACCAGC CCGCTGGGCC GCCGCTACCG GCAACCGCCG
ACCCGGCTGT GGGAACCGCC GCCGGAACAC CCACCCGAGC GGCCGGCGTC GGTCTTCACC
GCCGACCCAC CCGATCGGCC GATCCGGGAC GACGACCCGC CGCCGTTCTG A
 
Protein sequence
MFVDDHWGGR PVRVWSGARP PARWGLPESL THGVVLPFRA SAARAAVRVR SWRETLATTG 
PGADLARVLE TRPDGPVDRA DGSTDCSAEA TTTSVPLWAN TLIDSLQARQ SLMAHLQARQ
QADLAELSGC YPGLHEFLAT EIALALGIAE GTATRQLAEA VDLTTRLPET FAALDEGRIT
PVKASTIRSH TQDLDLDQSA LVEADVLPAA PRGTVPELRH AVQRAVIRHD PHGADDRHQA
QRERRRVSVK AQPDGMGSLW LLSTAQDVAT IQACLVAVGD AAASPDDGRT ADARRVDAMV
DLCAEVLDSG QWRGTALPTR QRRRPHVQVT VPITALLDPA TTAGQSAELH GYGPITAAQA
AQITADATLR RLVCDPLTGA LLDYGRTTYH PPAALADHVL VRDQTCRLPG CRQPAQRCEL
DHVEPFRPGH DTGGTTSATN LCLVCKHHHR AKDGGEFLLR RTADGYDWTS PLGRRYRQPP
TRLWEPPPEH PPERPASVFT ADPPDRPIRD DDPPPF