Gene Namu_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2521 
Symbol 
ID8448132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2772772 
End bp2774625 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content67% 
IMG OID645041631 
Productintegrase family protein 
Protein accessionYP_003201875 
Protein GI258652719 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0000524212 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00558917 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGGA CCGCCGAATC GACCGCGGCC CGGGCCGAGG AACCCGGGAT CGGCCAGGCT 
CGGGCGGCCT ACCTGGCCGA CGCTGGCACC CAGCCGGCAC GACTGTCCCG GTCGCGGAGC
ATCGGGCGGT TCGTCACCGA GTTCGGTGAC GTGAGCGGTT GGCGGGCCTG CTCGGTGCCC
GAACGGCTCG CCGCCGGAGA CCAGGCGCGA GCCTTCGCGC ACTTCGCTGT CGTGCACGAC
CGGGTCCCGG TCGAGGCCGA GTACGTCGCA GCCGTGACGT CGAGGTGGGG CCATCACGTC
GCGAACCGAG ATCCGGAGCA GGCTAACCAA TTCCGGCGCC AGGCCGCGTC GTTGAGTTTC
AATCGGTTGG AGATCGACAA GATGTGGTCC AAGCTCGCCC AGATCTGCGT CATCACCGGC
AGCCGCCCGG ACGAACTGAG CAGCGACAAC TACGTGGCCG GGCGGGCCGC GTTCTGGGCC
GCGGTCGTCG CCAAACACGG CGGCACGATC CCGACAACAC TGGCGACCCC CCTGTTCGGC
CTGGACGCCG TGATGTTCCA CCGGGGCCAA GCGCCCCGAC CCACGACACG CAAGTCATGG
GCTGCCCGGT CGGTGCCGGA GACCAGCTGG GACCAGATCA CCGCCGCCGC CCCGATGATG
GCCGCCACCA TGCACCGCTA CCTGGACCAG CTGCGGGTCA GCCTGCGCGC TTCATCGGTC
GCCAGCATCG AGATCACCCT GCGGCAGATC GCCGGGCACC TGATCTCCAC CAGCAGCGTC
ACGACGGTCG CCGACATCGG CCGGCCGCAG ATCGAGGCCT ACGGGACCTG GCTGGCCGGG
CGTGGCGGCT ACCGCAAGAA CAGCACAATC AGCAAAACGA CCATCGGGAT GCGGATGTGT
CACCTCGGCG CGTTCTTCCG CCGGATCATC GAATGGGGCT ACCCCGACGC ACCCCTTCGG
CCACCGGTCT TCTCCTCCGA CCGGCCGGCC AAAGACAAGC CGCTGCCCCG GTTCCTCGAC
GACGCTGCTA CCGCCAAGTT CATGGCCGCG GCACGGGAGC TGCCCGACCC GTTCGGTCGG
CTCGCCATCG AGATCTTGGC CCGCACCGGC ATGCGCAAGG GTGAGCTACT CGGTCTGACC
ACCGACGCCG TTGTGCAGAT CGGCTCCGCC TACTGGCTGC GGATCCCTGT CGGCAAACTC
CACAACGACC GCTACGTTCC TCTCCACCCG CAACTGAAAA CCATGATCGA CAACTGGCTC
GAACAGCGAC CCGACTGGCA AAACAGCCAC CTGCTGTTTA CCGACCGGGG CCGGCCGATC
CCACCCACCC GAGTCGACCA CGCCGTTCAA ACAGCCGCCA CCGCAGCCGG TATCGGTCAC
GTGCACCCAC ACCAACTCCG GCACACGCTA GCCACCCAGG CAATCAACCG CGGCATGAGC
CTCGAAGCGA TCGCCGCGCT GCTGGGCCAC AAGACGATGA GCATGACCCT GGTCTACGCG
CGCATCGCCG ACCGCACCGT CGCCGACCAG TACTTCACCG TCACCGAGAA GGTCCAAGCT
CTCTACCAAC AGCAGCAACC CGCAATGCTA CCCGCCGCGG ACGAACCAGC CCCGATGCGC
AAACTCCGCG CCGAACACAG CAAACGCATG CTCGGCAACG GCTTCTGCAC CCGACCCGCC
GAACTCGAAT GCCACTACGA AACGATCTGC GAGTCCTGCA CCTTCTTCGT CACCACCATC
GAATTCCGAC CCACCCTGCA GGCTCAACGT GACGACGCCG CCCGCAAGGG TCAAAGCGGC
CGTCAGAAGG TCTACGACGG ACTCCTTCAG CGACTCGCCG ACACCGCTAC TTGA
 
Protein sequence
MSGTAESTAA RAEEPGIGQA RAAYLADAGT QPARLSRSRS IGRFVTEFGD VSGWRACSVP 
ERLAAGDQAR AFAHFAVVHD RVPVEAEYVA AVTSRWGHHV ANRDPEQANQ FRRQAASLSF
NRLEIDKMWS KLAQICVITG SRPDELSSDN YVAGRAAFWA AVVAKHGGTI PTTLATPLFG
LDAVMFHRGQ APRPTTRKSW AARSVPETSW DQITAAAPMM AATMHRYLDQ LRVSLRASSV
ASIEITLRQI AGHLISTSSV TTVADIGRPQ IEAYGTWLAG RGGYRKNSTI SKTTIGMRMC
HLGAFFRRII EWGYPDAPLR PPVFSSDRPA KDKPLPRFLD DAATAKFMAA ARELPDPFGR
LAIEILARTG MRKGELLGLT TDAVVQIGSA YWLRIPVGKL HNDRYVPLHP QLKTMIDNWL
EQRPDWQNSH LLFTDRGRPI PPTRVDHAVQ TAATAAGIGH VHPHQLRHTL ATQAINRGMS
LEAIAALLGH KTMSMTLVYA RIADRTVADQ YFTVTEKVQA LYQQQQPAML PAADEPAPMR
KLRAEHSKRM LGNGFCTRPA ELECHYETIC ESCTFFVTTI EFRPTLQAQR DDAARKGQSG
RQKVYDGLLQ RLADTAT