Gene Namu_3706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3706 
Symbol 
ID8449325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4069238 
End bp4070242 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content69% 
IMG OID645042764 
Productintegrase family protein 
Protein accessionYP_003203000 
Protein GI258653844 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.000354013 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.710204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCG TGCCCTGTTT GCAGCGCCCG GCCGGGGAGG TGCCGCGGCT CGGGGAGGTG 
CTGTTGGACG AGTACCTGCG GTTCGTCGCG GCGCGGTGTC GGCCGAATAC GCTGCTGGCT
CAGATGTTCG ACCTGAAGGT GTTCTTCACG GTCGTGGGGC GGCCGCCGGT GGAGGTGACC
ACGGCCGACG TCCTTCGCTT CATCGAGCGG CAGCGGGCAG CCCGTAACGG CAACGTGGTC
CGGCTCGCCG ATGGCGAGTC GGGGTTGGCG TTGACGACGA TCAAGCGGCG TCTCGCGACG
GTCTCCGGCC TGTTCGAGTA TCTGGCCATT CGCGGGTTGG TGGCACGGAA CCCGGTGCCG
CGCAGTCTGT CTGCTCGGCC GGGCCGGGCA CCTGTGCGAG GTGCTCCGTT GATCCGGGCG
CCGCGCCGGC TGCCGCGGAT CCTGAGCCCG GCCGAGGTGA ACGCGCTGAT CAGCGCGCTG
CGGACGGCCC GGGACCGGGC CATGGTGTGG CTGATGCTGC TCGGCGGCCT TCGCCGCTGC
GAGGTCCTGG GCCTGCGGCA TCGTGACGTC CAACCGGGCG AGCGGCGAGT GTTCGTCACC
GGCAAGGGCG GCCACGAGCG GGTCGTGCCG GTCGGGAAGG TGTTCTTCGC CGAGCTGGCC
GGCTATTACG CCACGGAGCG ACCAGACACC GACACCGATC AGGTGTTCGT TGTGTTGAAG
GGACAACGCC GAGGCCAGCC GCTGTCCGCG GCCGGGGTGG ACGAGGTGCT CTCCGGCGCC
CGCCGGCGGG CCGGGCTCGC CCACGCCACC TGCCATGAGT TGCGCCATAC CTGCTTCACC
CGGCTCCGCG AATCCGGGAT GGCGTTGGAG GCGATCCAGG CCCAGGCTGG GCACGTCTCG
ATCGAGACCA CCAAGATCTA CCTGCATCTG GCCCCGGACT GGCTGGTCGA CGAGTACCGA
AAGGCGATGG ACATCCTCGA CGACATCGCG GGAGCTCAAG GATGA
 
Protein sequence
MEFVPCLQRP AGEVPRLGEV LLDEYLRFVA ARCRPNTLLA QMFDLKVFFT VVGRPPVEVT 
TADVLRFIER QRAARNGNVV RLADGESGLA LTTIKRRLAT VSGLFEYLAI RGLVARNPVP
RSLSARPGRA PVRGAPLIRA PRRLPRILSP AEVNALISAL RTARDRAMVW LMLLGGLRRC
EVLGLRHRDV QPGERRVFVT GKGGHERVVP VGKVFFAELA GYYATERPDT DTDQVFVVLK
GQRRGQPLSA AGVDEVLSGA RRRAGLAHAT CHELRHTCFT RLRESGMALE AIQAQAGHVS
IETTKIYLHL APDWLVDEYR KAMDILDDIA GAQG