Gene Namu_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4520 
Symbol 
ID8450147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5022745 
End bp5024226 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content70% 
IMG OID645043560 
ProductIntegrase catalytic region 
Protein accessionYP_003203788 
Protein GI258654632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGG ACACGGTCAA GCAGCGGGAA CGGGCGCAGC AGATCGCGTT GTTCCGATAT 
CAGTTGATCT GCCCGGCGCT GGAACCGGGC CTGTCGACCA AGCAACGCGG ACGGGTCGTT
CGGGCGATCG CCGACCGGGA ACACGACGGC CCGTTCGGCG GCCGGGTCCG ATACTCGCGG
GAGTCGTTGG ACCGGTGGAT CCGCCGGTAC CGGGCCGGCG GGTTCGAAGG TCTGTGCCCG
TCGCCCCGGG AACCCGGCAC CCGGATCGAC ACCGGCGTGT TCGAGCTGGC CGCCGGTCTG
AAACGGGAGA ACCCGGCCCG CACGGTCGCC CAGGTCGCCC GGATCCTGCG ATCCTCGACC
GGCTGGTCAC CGTCGGAAAC GACGCTGCTG CGGCATTTCC ACCGGCTGGA CCTGATGGTG
CCCGGCGGCG CCGGGCCGGC CGTGTTCGGC CGGTTCGAAG CGGCCGATTG CAACGAACGG
TGGGTCGGCG ACGCCCTGCA CGGGCCCAGG GTCGCCGGCC GGAAAACGTA CTTGTTCGCG
TTCCTGGACG ACCACAGCCG GGTGGCCGTG GGGTATCGGT TCGGGTTCGC CGAGGACACC
GTCCGGCTGG CCGCGGCCCT GCAACCCGCG TTGGGCAGCC GCGGCGTCCC CGGCTCGGTC
TACGTCGACA ACGGGTCCGC GTTCGTCGAC AACTGGCTGC TGCGGGCCTG CGCGGTGCTC
GGGATCCGGC TCGTCCACAG CCGTCCGGGG CAGCCGCAGG GGCGGGGCAA GATCGAACGC
TGGTTCCGCA GCGTGCGCGA CCAGTTCCTG GTCGAGATCG ATGACAGCAC CGCCGACCAG
ATCCGGGATA CCGGGATGAC CCCCGCCGGC GCCCTGCTGG AACTCAACGG GTTGTTCACC
GCCTGGGTCG AGGCGTCCTA TCACCACCAC GTGCATTCCG AGACCGGGCA GAGCCCCTTG
CAACGCTGGA CCGACGGGTG GCAGCGGGCC GGCCGGTCTC CGGCGATGCC GACCCCCGCG
GATCTGACCG AGGCGTTCCT GTGGTCCGAA CAACGCGTGG TCACCAGGAC CGCGACGGTG
TCGCTGCACG GCAACACCTA CCAGGTTCAG GCGGGGCTGG TCGGTCGGAA AGTCGAGTTG
GTGTTCTCCC CGTTCGATCT GGAAACCCTG CGGGTCCGCT ACGACGGCCG GGACCACGGG
CCGGCGGTGC CGCATCGGAT CACCCGGCAC ACCCATCCCA AGGCCAGACC CGAGACCCCT
GAACCGGCAA CGACACCGCG GACGGGGATC GACTACCTGG CGCTGGTCGC GCAGGACCAC
CAGCAACAGA TCAGTGCCGA CCAGAAGATC AACTATCACG CCCTCTACCC AGGTGAGCTG
CCCGGGCAGC GCAGCATCGA CGACGCCCTG GCCGACCTCA ACGGCAACGA CGGCAACGAC
GGCAACGACG GCAACGACGA TGGTCAGGCG GTGGCCCGGT GA
 
Protein sequence
MSVDTVKQRE RAQQIALFRY QLICPALEPG LSTKQRGRVV RAIADREHDG PFGGRVRYSR 
ESLDRWIRRY RAGGFEGLCP SPREPGTRID TGVFELAAGL KRENPARTVA QVARILRSST
GWSPSETTLL RHFHRLDLMV PGGAGPAVFG RFEAADCNER WVGDALHGPR VAGRKTYLFA
FLDDHSRVAV GYRFGFAEDT VRLAAALQPA LGSRGVPGSV YVDNGSAFVD NWLLRACAVL
GIRLVHSRPG QPQGRGKIER WFRSVRDQFL VEIDDSTADQ IRDTGMTPAG ALLELNGLFT
AWVEASYHHH VHSETGQSPL QRWTDGWQRA GRSPAMPTPA DLTEAFLWSE QRVVTRTATV
SLHGNTYQVQ AGLVGRKVEL VFSPFDLETL RVRYDGRDHG PAVPHRITRH THPKARPETP
EPATTPRTGI DYLALVAQDH QQQISADQKI NYHALYPGEL PGQRSIDDAL ADLNGNDGND
GNDGNDDGQA VAR