Gene Namu_4622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4622 
Symbol 
ID8450250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5143727 
End bp5144962 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content60% 
IMG OID645043663 
ProductRestriction endonuclease S subunits-like protein 
Protein accessionYP_003203890 
Protein GI258654734 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.156513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG ATCTCACTAC GCTCGGAGCT GTTGTTAAGG CCACCGGGGG CGTCCTCCAA 
ACCGGGCCCT TCGGAAGTCA GCTTCACGCC AGTGACTACC AGTTCACCGG CAAACCACTC
GTAATGCCGG TGAATCTCGG AGACAACGAA ATCCGGGAGG CTGGCATTGC GCGCATCGGG
GTTGAGGACG CTCACAGACT GCGTCGCCAT GCTCTGCGGG AGGGCGACAT CATTTTCAGC
AGGAGAGGAG ACGTTGGTCG TCGATCCCTC GTGCGGACGA GAGAGGCGGG TTGGCTCTGC
GGGACAGGCT GCCTCGCGGC TCGATTTGGA AGTGACCGGA CGACGGTCAA TCCGGCGTAC
GTCGCTGACT ACCTTGGAGG GACGTCGGCG CAGGCATGGC TCGTCGACAA TGCCGTTGGC
GGGACCATGC CCAACCTGAA TACAAGCATT CTTTCGGCAT TACCCGTGTG GCTACCGTCG
AAATTGGAGC AGGACCGTAT TGTTGCCGCG CTTGAAGATG TCCGGAAAGT GATCGATTCC
ATCCAGCACC TTATCGCCAA GAGGCAGGCG ATCAAGCAAG GCATGATGCA GCATCTCCTT
ACGGGTCGAA CGCGGCTCCC AGGTTTCAAC GAGGCATGGA GCGAGACAAC GCTCGGAGCT
GTCGCACGTT TCAGCAAGGG TGCGGGACTT CCGAAGGCGG CTCTGACATC TTCTGGCTCG
ACCCTGTGTA TTCATTACGG TGAGCTATTC ACGTTCTACG GTCCCGAAAT CCGTCAGGTT
TTCAGCCGAA CAACGCCTAC CGGACGCGTG GTCGTGTCTG AGGACCTCGA TGTCCTGATG
CCTACGTCCG ATGTGACACC ACGCGGACTG GCTAAAGCCA GTGCGATCCA CGGCGCCGGA
GTCGTATTGG GCGGCGACAT CCTTATCATT CGACCTGACA AGGCACATGC TCATGGCCCG
TTCGTCGCTC ACGCCATTCG GCATCACGCG GACCAAGTGC TCCAGCTCGT GCGCGGGTCC
ACTGTCTACC ACCTGTATGC CACTGACATG CGAAATTTCG CGCTCTCGCT CCCGTCGGTG
AATGAGCAGC GTGCGATCGC CGGCGCACTG CTTGACGCCG ATCGACAACT CGAAGCGTTG
GAGGAGCGTC TGATGAAGGC TCGCGCCTTC AAGACCGGAA TGATGCAGCG CCTCCTGACT
GGACATACGC GCTTGCCGAC GGAGGCCGCG ACATGA
 
Protein sequence
MSGDLTTLGA VVKATGGVLQ TGPFGSQLHA SDYQFTGKPL VMPVNLGDNE IREAGIARIG 
VEDAHRLRRH ALREGDIIFS RRGDVGRRSL VRTREAGWLC GTGCLAARFG SDRTTVNPAY
VADYLGGTSA QAWLVDNAVG GTMPNLNTSI LSALPVWLPS KLEQDRIVAA LEDVRKVIDS
IQHLIAKRQA IKQGMMQHLL TGRTRLPGFN EAWSETTLGA VARFSKGAGL PKAALTSSGS
TLCIHYGELF TFYGPEIRQV FSRTTPTGRV VVSEDLDVLM PTSDVTPRGL AKASAIHGAG
VVLGGDILII RPDKAHAHGP FVAHAIRHHA DQVLQLVRGS TVYHLYATDM RNFALSLPSV
NEQRAIAGAL LDADRQLEAL EERLMKARAF KTGMMQRLLT GHTRLPTEAA T