Gene Namu_3611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3611 
Symbol 
ID8449230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3966849 
End bp3968048 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID645042681 
Productintegrase domain protein SAM domain protein 
Protein accessionYP_003202917 
Protein GI258653761 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00442535 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0137307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATTCGT TGCCCGCGCC GCGCGATGTC GCGATGCTTC GTGTTCCAAC GATCGGCCGG 
GTCCACCGGA CTGGTCGGGG TCCGTGGGAG GTGCTGGGCG CAGACGGCGA GCCGGTGCTG
GCGTGGGTGA GTTTCCGGGG CGAGCTTGTC GCCGGCGGGT GTTCCGCCGC GACGTGCCGT
TCCTACGCCC ACGACATTCT GCGGTGGCTG CGTTTCCTTG CCGCGGTGGG CGTTTCGTGG
CAGCAGGCCG GCAGGGTCGA GGTGCGCGAC TACGTGCGGT GGCTGCGCAC GGCGGCCAAC
CCAGCCCGCG ACCGGCGCAC AGCGGCGGGT GGCCGGCCAC CGGCCGGGAC GGTGAACACC
GCGACGGGCA AGGCCTACCT AGCGGCGGGG TATGCGCCCA GGACCATCAA CCATGCGTTG
TCGGTGCTCA GCGAGTTCTA TCGGCACGCC GTCGACGCCG ATCTCGGCCC GTTGCGGAGC
CCGGTGCCGC TGCGGCGGAG CGTGACCCGG TTCCCCGGCC AGTCCACGGC ACGAGCGGCT
GTCGGCGGCC CGGCCTATCG GCAGCGGGAA CCGGTCGCTC AACCCCGCGC GCTGTCCGAG
CCGTTGCTGC AGCGTGTGTT CGCGGCTCTG CGGCATGACC GGGACCGTGC CCTGATCGCG
GTGGCGTTGA GTTCGGGGGC GCGGGCGAGC GAGTTGTTGT CGATGGTCCG CAACGGGATC
GACGTCGGCT TGGGTGTGGT GTCGGTGGTT CCGAAGGGTC GACCGGGTCG GGTGTGGATA
CCGCTGGCGC CGGAGGCGCT GGTGTTGATT GGCCGTTACC TCGCGGCGCA ACCACTGGGG
TTACCCGACG ACCCGGTGTG GATGACGATC CGTCGGCCGG CTCATCCGTT GACGTATTTC
GCGATGCGTC AGGTCCTGGA GCGGGTCAAC CAGGAGCTCG GCACGAACAT CACCTGGCAC
GACTTCCGTC ACACGTTCGC GCATCGGCTG TTGGCCGACG ACCGGTTGTC ACTGACGGAT
GTGCAGACGC TGATGCGGCA CCGCAGCCTG ACGACGTTGA CGGACTACTC CGCGGCCAGG
TTGGACGAGT TGGTGACCCG CTTGCACGAG CACCTGGCCC GGCCGGCGCC GGCGCCCACC
GTGGCCGTGG GCTACGACCA GGACGACATG CAGGTCCTGT TCCCGGGTTT GACACCGTGA
 
Protein sequence
MDSLPAPRDV AMLRVPTIGR VHRTGRGPWE VLGADGEPVL AWVSFRGELV AGGCSAATCR 
SYAHDILRWL RFLAAVGVSW QQAGRVEVRD YVRWLRTAAN PARDRRTAAG GRPPAGTVNT
ATGKAYLAAG YAPRTINHAL SVLSEFYRHA VDADLGPLRS PVPLRRSVTR FPGQSTARAA
VGGPAYRQRE PVAQPRALSE PLLQRVFAAL RHDRDRALIA VALSSGARAS ELLSMVRNGI
DVGLGVVSVV PKGRPGRVWI PLAPEALVLI GRYLAAQPLG LPDDPVWMTI RRPAHPLTYF
AMRQVLERVN QELGTNITWH DFRHTFAHRL LADDRLSLTD VQTLMRHRSL TTLTDYSAAR
LDELVTRLHE HLARPAPAPT VAVGYDQDDM QVLFPGLTP