Gene Namu_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1238 
Symbol 
ID8446834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1359930 
End bp1361783 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content67% 
IMG OID645040373 
Productintegrase family protein 
Protein accessionYP_003200632 
Protein GI258651476 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.375916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA CCGCCGAATC GACCGCGGCC CGGGCCGAGG AACCCGGGAT CGGCCAGGCT 
CGGGCGGCCT ACCTGGCCGA CGCTGGCACC CAGCCGGCAC GACTGTCCCG GTCGCGGAGC
ATCGGGCGGT TCGTCACCGA GTTCGGTGAC GTGAGCGGTT GGCGGGCCTG CTCGGTGCCC
GAACGGCTCG CCGCCGGAGA CCAGGCGCGA GCCTTCGCGC ACTTCGCTGT CGTGCACGAC
CGGGTCCCGG TCGAGGCCGA GTACGTCGCA GCCGTGACGT CGAGGTGGGG CCATCACGTC
GCGAACCGAG ATCCGGAGCA GGCTAACCAA TTCCGGCGCC AGGCCGCGTC GTTGAGTTTC
AATCGGTTGG AGATCGACAA GATGTGGTCC AAGCTCGCCC AGATCTGCGT CATCACCGGC
AGCCGCCCGG ACGAACTGAG CAGCGACAAC TACGTGGCCG GGCGGGCCGC GTTCTGGGCC
GCGGTCGTCG CCAAACACGG CGGCACGATC CCGACAACAC TGGCGACCCC CCTGTTCGGC
CTGGACGCCG TGATGTTCCA CCGGGGCCAA GCGCCCCGAC CCACGACACG CAAGTCATGG
GCTGCCCGGT CGGTGCCGGA GACCAGCTGG GACCAGATCA CCGCCGCCGC CCCGATGATG
GCCGCCACCA TGCACCGCTA CCTGGACCAG CTGCGGGTCA GCCTGCGCGC TTCATCGGTC
GCCAGCATCG AGATCACCCT GCGGCAGATC GCCGGGCACC TGATCTCCAC CAGCAGCGTC
ACGACGGTCG CCGACATCGG CCGGCCGCAG ATCGAGGCCT ACGGGACCTG GCTGGCCGGG
CGTGGCGGCT ACCGCAAGAA CAGCACAATC AGCAAAACGA CCATCGGGAT GCGGATGTGT
CACCTCGGCG CGTTCTTCCG CCGGATCATC GAATGGGGCT ACCCCGACGC ACCCCTTCGG
CCACCGGTCT TCTCCTCCGA CCGGCCGGCC AAAGACAAGC CGCTGCCCCG GTTCCTCGAC
GACGCTGCTA CCGCCAAGTT CATGGCCGCG GCACGGGAGC TGCCCGACCC GTTCGGTCGG
CTCGCCATCG AGATCTTGGC CCGCACCGGC ATGCGCAAGG GTGAGCTACT CGGTCTGACC
ACCGACGCCG TTGTGCAGAT CGGCTCCGCC TACTGGCTGC GGATCCCTGT CGGCAAACTC
CACAACGACC GCTACGTTCC TCTCCACCCG CAACTGAAAA CCATGATCGA CAACTGGCTC
GAACAGCGAC CCGACTGGCA AAACAGCCAC CTGCTGTTTA CCGACCGGGG CCGGCCGATC
CCACCCACCC GAGTCGACCA CGCCGTTCAA ACAGCCGCCA CCGCAGCCGG TATCGGTCAC
GTGCACCCAC ACCAACTCCG GCACACGCTA GCCACCCAGG CAATCAACCG CGGCATGAGC
CTCGAAGCGA TCGCCGCGCT GCTGGGCCAC AAGACGATGA GCATGACCCT GGTCTACGCG
CGCATCGCCG ACCGCACCGT CGCCGACCAG TACTTCACCG TCACCGAGAA GGTCCAAGCT
CTCTACCAAC AGCAGCAACC CGCAATGCTA CCCGCCGCGG ACGAACCAGC CCCGATGCGC
AAACTCCGCG CCGAACACAG CAAACGCATG CTCGGCAACG GCTTCTGCAC CCGACCCGCC
GAACTCGAAT GCCACTACGA AACGATCTGC GAGTCCTGCA CCTTCTTCGT CACCACCATC
GAATTCCGAC CCACCCTGCA GGCTCAACGT GACGACGCCG CCCGCAAGGG TCAAAGCGGC
CGTCAGAAGG TCTACGACGG ACTCCTTCAG CGACTCGCCG ACACCGCTAC TTGA
 
Protein sequence
MSGTAESTAA RAEEPGIGQA RAAYLADAGT QPARLSRSRS IGRFVTEFGD VSGWRACSVP 
ERLAAGDQAR AFAHFAVVHD RVPVEAEYVA AVTSRWGHHV ANRDPEQANQ FRRQAASLSF
NRLEIDKMWS KLAQICVITG SRPDELSSDN YVAGRAAFWA AVVAKHGGTI PTTLATPLFG
LDAVMFHRGQ APRPTTRKSW AARSVPETSW DQITAAAPMM AATMHRYLDQ LRVSLRASSV
ASIEITLRQI AGHLISTSSV TTVADIGRPQ IEAYGTWLAG RGGYRKNSTI SKTTIGMRMC
HLGAFFRRII EWGYPDAPLR PPVFSSDRPA KDKPLPRFLD DAATAKFMAA ARELPDPFGR
LAIEILARTG MRKGELLGLT TDAVVQIGSA YWLRIPVGKL HNDRYVPLHP QLKTMIDNWL
EQRPDWQNSH LLFTDRGRPI PPTRVDHAVQ TAATAAGIGH VHPHQLRHTL ATQAINRGMS
LEAIAALLGH KTMSMTLVYA RIADRTVADQ YFTVTEKVQA LYQQQQPAML PAADEPAPMR
KLRAEHSKRM LGNGFCTRPA ELECHYETIC ESCTFFVTTI EFRPTLQAQR DDAARKGQSG
RQKVYDGLLQ RLADTAT