Gene Namu_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3579 
Symbol 
ID8449198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3930560 
End bp3932224 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content71% 
IMG OID645042653 
ProductIntegrase catalytic region 
Protein accessionYP_003202889 
Protein GI258653733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.318948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0294247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACG AGGCGGTGGT CCAGGAGCGG GCGCTGTCGG TTCCGGACGA GGTTTGGAAT 
CTGGCGGTTC GTCGGGCAGC TGTGATCGGC CCGCTGGCTG AGCGGGATGT CGTGGGTCGG
GCGTCGGTGG AGGCAGCCGC GGCGGAGCTG CGGGTGTCCG TTCGTCAGGT CTACGTGCTG
CTTCGCCGCT GGCGGCAGGG CGAGGGGGTG GTGTCGGATC TGATTCCGGG CTGGTCCAGC
GGCGGTCGTG GTCGCGACCA GCTCCCGGAG GAGGTCGAGG TGGTTATCCG GGAGGTGCTG
CGCCGGCAAT ACCTGACTCG GCAGCGGAAG ACGGTCGCGA CGGTGCACCG GGAGATCGTC
CGGCAATGCC GCACCCAGGG ACTGCGGCCG CCGTCGCGGG GTTCGGTGGT GCGGCGGATC
GCGAAGCTCG ACCCGGCGCA GGCCACGCGT CGGCGGGAGG GATCCGACGC GGCGCGGGCC
CGGGCGTCGG CCGGCGGGGT GCCTCCGTCG GTCACCGCGG TGCTGGAACA GATCCAGATC
GACCATACGG TGGTCGACCT GATCGTGGTC GACGAGCGGC ACCGGCTGCC GATCGGCCGC
CCGTATCTGA CGGCCGGGAT CGACGTGTTC AGCCGGGCGG TCCCCGGATT GGTGGTCACG
TTGGAGGCGC CGTCGGCGAC CACGGTCGGG TTGTGCCTGG CGCACATGGT CACCGACAAG
CGGCCCTGGC TGGAGCACCT CGGGGTGGAG GCGTCCTGGC CGATGAGCGG CAAACCGGTC
GAGCTGTACC TGGACAACGC CGCGGAATTC AGGAGCGAGG CGTTGCGTCG CGGATGTGAG
CAGCACGGGA TCGCGCTGCG GTACCGGCCA CCCGGGCGGC CGCACTACGG CGGCATCGTC
GAGCGGGTGA TCGGCACGAT GATGCGGCTG ATCCACGAGC TGCCCGGGAC GACGTTCTCC
AATCCCGGGC AGCGATCGGG GTACGACGCC GACGCGACGG CCTGCCTGAC GATGACGGAG
CTGCAGCGGT GGCTGGCCCT GGCGGTGGCC GCCTATCACG GACAGGTCCA CGGCACCCTG
CGCCAGCCAC CCGCGGCCCG ATGGGTCGAC GGGATCGCCA CCACCGGCGC GCCGGCGGTC
GTCACCCACG AGGCGGCGTT CCTGGTCGAC TTCCTGCCGG TGATCCGCCG GACCCTGACC
AGAACCGGGT TCGTGATCGA CCACGTGCAC TACTTCTGCG ACGCGCTCAA GCCCTGGATC
GCCCGCCGGG ATCGACTGGG CCGGTTCGTG ATCCGACGGG ACCCGAGAGA CATCAGCCGG
GTCTGGGTCC TGGACCCGGA CGACGGGACC TACCTCGAGG TGCCGTACCG GACCATGTCC
CATCCGGCGG TGAGCGTGTG GGAACACCGG GCGGCGGTGG AACGACTCCG CACCCTTGGC
CGTGACCAGG TCGACGAGGA CGCCCTGTTC CGGACGGTGG AGCAGATGCG GTCCATCACC
GAGTCAGCCG CGGCGACTAC CCGCAAGGCC CGCCGCGACA CCGCCCGTCG AGCAGCCAGC
GGCGGCACGA GCGCCCAGCG GCCCAGCAGC ACTACGCCGG AATGTCCACC CGACGACGCC
GTGGCGACGC CGGCGGCCCC ATTCGACGTG ATCGAGCAAT GGTGA
 
Protein sequence
MVDEAVVQER ALSVPDEVWN LAVRRAAVIG PLAERDVVGR ASVEAAAAEL RVSVRQVYVL 
LRRWRQGEGV VSDLIPGWSS GGRGRDQLPE EVEVVIREVL RRQYLTRQRK TVATVHREIV
RQCRTQGLRP PSRGSVVRRI AKLDPAQATR RREGSDAARA RASAGGVPPS VTAVLEQIQI
DHTVVDLIVV DERHRLPIGR PYLTAGIDVF SRAVPGLVVT LEAPSATTVG LCLAHMVTDK
RPWLEHLGVE ASWPMSGKPV ELYLDNAAEF RSEALRRGCE QHGIALRYRP PGRPHYGGIV
ERVIGTMMRL IHELPGTTFS NPGQRSGYDA DATACLTMTE LQRWLALAVA AYHGQVHGTL
RQPPAARWVD GIATTGAPAV VTHEAAFLVD FLPVIRRTLT RTGFVIDHVH YFCDALKPWI
ARRDRLGRFV IRRDPRDISR VWVLDPDDGT YLEVPYRTMS HPAVSVWEHR AAVERLRTLG
RDQVDEDALF RTVEQMRSIT ESAAATTRKA RRDTARRAAS GGTSAQRPSS TTPECPPDDA
VATPAAPFDV IEQW