Gene Namu_4946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4946 
Symbol 
ID8450577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5522951 
End bp5524420 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content69% 
IMG OID645043984 
ProductIntegrase catalytic region 
Protein accessionYP_003204208 
Protein GI258655052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGG ACACGGTCAA ACAACGGGAA CGGGCGCAGC AGATCGCGTT GTTCCGATAT 
CAACTGATCT GCCCGGCCCT GGAACCCGGG CTCTCGACCA AGCAACGCGG CCGGATCGTC
CGGGCGATCG CCGACCGGGA CCACGACGGC CCGTTCGGCG GCCGGGTCCG GTACTCACGG
GAATCGTTGG ACCGGTGGAT CCGCCGCTAC CGGGGCGGCG GGTTCGAGGC CCTGTCCCCG
TCACCCCGGC AGCCCGGCGC CCGGATCGAC GCCCAGGTGT TCGAGCTGGC CGCCGCCCTG
AAACGGGAGA ACCCGGACCG CACGGTCGCC CAGGTCGCGC GGATCCTGCG AGCCAGCACG
GGTTGGTCAC CGTCGGAAAC GACGCTGCTA CGACATTTTC ACCGGCTGGA CCTGATGGTC
CCCGGCGGCG CCGGCCCGGC CGTGTTCGGC CGGTTCGAGG CCCCGGACTG CAACGAACGG
TGGGTCGGTG ACGCCCTGCA CGGCCCCAGG GTCGCCGGCC GGAAAACCTA CCTGTTCGCT
TTCCTTGATG ATCATTCACG GCTGGCCGTG GGGTACCGGT TCGGGTTCGC CGAGGACACC
GTCCGACTGG CCGCGGCCCT GCAACCCGCC CTGGGCAGCC GCGGCGTCCC CGGCTCGGTC
TACGTCGACA ACGGGTCCGC GTTCGTCGAC AACTGGCTGC TGCGGGCCTG CGCGGTGCTC
GGGATCCGGC TGGTCCACTC CCGTCCCGGC CAGCCGCAGG GGCGGGGCAA GATCGAACGC
TGGTTCCGCA GCGTGCGGGA CCAGTTCCTC GTCGAGATCG AAGACAGCAC CGCCGAGAAG
GTCCGCGACG CGATCATGAC CCCGGCCCAG GCGCTGCTGG AACTCAACGG ATCGTTCACC
GCGTGGGTCG AAGCGTCCTA CCACCACCGC GTCCATTCCG AGACCGGGCA AACCCCTTTG
CAACGCTGGA ACGACGGGTG GCAGCGGGCC GGCCGGTCCC CGGCCATGCC GACCCCGGCC
GATCTGACCG AGGCGTTCCT GTGGTCCGAA CAACGGGTGG TGACCAAGAC CGCGACGGTG
TCGCTGCACG GCAACACCTA CCAGGTCGAC CAGGTCCTGG CCGGCCGGAA GGTCGAGTTG
GTGTTCTCCC CGTTCGACCT GGAAACCATC CGCGTCCGCT ACGACGGCCG CGATCACGGG
CCGGCCGTGC CGCACCGCAT CCACCGGCAC ACCCACCCCA AAGCGCGACC CGAAACCCCG
GAGCCGGCCA CTACACCGCG GACCGGGATC GACTACCTCG CGCTGGTTGC GCAGGACCAC
CAACAACAGA TCACCACCGA TCAGAAGATC AACTACCACG CCCTCTACCC CGGTGAACTG
CCCGGACAGC TCAGCATCGA CGACGCCCTG GCCGACATCA ACACCCACGA CCGACCCGAC
CAGGGCGAAC GGCCGGGGGT GGCCGGATGA
 
Protein sequence
MSVDTVKQRE RAQQIALFRY QLICPALEPG LSTKQRGRIV RAIADRDHDG PFGGRVRYSR 
ESLDRWIRRY RGGGFEALSP SPRQPGARID AQVFELAAAL KRENPDRTVA QVARILRAST
GWSPSETTLL RHFHRLDLMV PGGAGPAVFG RFEAPDCNER WVGDALHGPR VAGRKTYLFA
FLDDHSRLAV GYRFGFAEDT VRLAAALQPA LGSRGVPGSV YVDNGSAFVD NWLLRACAVL
GIRLVHSRPG QPQGRGKIER WFRSVRDQFL VEIEDSTAEK VRDAIMTPAQ ALLELNGSFT
AWVEASYHHR VHSETGQTPL QRWNDGWQRA GRSPAMPTPA DLTEAFLWSE QRVVTKTATV
SLHGNTYQVD QVLAGRKVEL VFSPFDLETI RVRYDGRDHG PAVPHRIHRH THPKARPETP
EPATTPRTGI DYLALVAQDH QQQITTDQKI NYHALYPGEL PGQLSIDDAL ADINTHDRPD
QGERPGVAG