Gene Namu_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1602 
Symbol 
ID8447201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1766259 
End bp1768082 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content71% 
IMG OID645040729 
ProductIntegrase catalytic region 
Protein accessionYP_003200985 
Protein GI258651829 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.090134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.142555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGG TTGTCTTGTC GAAGGTGGAG CAGCGGCTGG ACGCAGTTCG GGCGGTGCTG 
GCGGGTGCCA CGGTGACCGA GGTCGCGGCC GCGGTCGGGG TGTCCCGGGT GAGCGTGCAT
GCCTGGTTGC GGCGGTACCT GACCGAGGGC GTGACCGGGC TGGCGGACCG GTCGCACCGA
CCCCGGTCGT GCCCGCACCA GGCCGGCGAC GAGGTGTCGG TGCGGGTCGC CGAGCTGCGG
CGGACCCACC CACGGTGGGG CGCCAAGCGC ATCCGGATGG AGCTGCTGCG CAAGCCCGCC
GGCCTGACGG TTCCGTCGAC GGCCACGATC AACCGGATCC TGATCCGGCA CGGCCTGGTC
ACCCCACGCC GCCGGAAACG GCCGCGGAGC TCGTACCAGC GGTGGGAACG ACCGGGGCCG
ATGCAGCTGT GGCAGCTCGA CATCGTCGGC GACGTCTGGC TGGTCAACCC TGCCACCGGA
GTTTTGCGGG GGGTCAAGGT CGTGACCGGG GTGGACGACC ATTCCCGGTT CTGCGTGATC
GCCGCGGTCG TCGAGCGGGC CACCGGGCGG GCGGTGTGCC TGGCCCTGGC CGCGGCGTTG
GCCCGGTTCG GTGTTCCGGG CGAGATCCTG ACCGACAACG GTAAACAGTT CACCGCCCGG
TTCGGTCGCG GCGGGGAGGT GTTGTTCGAC AAGATCTGCC GGCACAACGG GATCACCCAC
CGGCTGACCC AGCCGGCGTC GCCGACCACG ACCGGCAAAA TCGAACGATT CCACCTGACG
CTGCGCCGGG AGCTGCTCGA CGATCACGAG CCGTTCGAAT CGCTGGCCGC CGCGCAGGCC
GCGGTCGACG AGTTCGTCCG GGTCTACAAC ACCGAACGGC CGCACCAGGC CTTGGACGGG
CAGCGGCCGG TCAGCCCGGC CGACCGGTTC ACCCCCATCA ACCCCGCCGA GGCGGAGTTG
GTGCCGCTGT GGCTGCCACC CACGGTCGCC CCCGCGCCGG CTCCGACGAC GTCCGCACCG
TCCGGTGACC AACCCGACAG TGCCGGCGCG GCAACACCTC CAGGCCCCGG CCGGGGCGGC
GCGGGCGGGG CGGTCGAGTT CGACCGGGTT GTTCCGCCGG CCGGGAACTT GCAACTGGCC
GGCCGGCAGC TCTGGCTCGG CCCGGCCCGC TCCGGGCAGG TCGTGCGGTT CTGGGCCGAC
ACCACCCTGA TCCATTTGTT CATCGGGGGA ACCCGGGTCA AGACGGTCCG CTCGCACCTG
ACGGTCACCG ACCTGTCCGC GCTGCTCGCC GCCGGGGCCG TGGCCGCCGG CCCGTCCCCG
CTGCCGCCGG TCCAACCCGG TGACGCCGTG GAGGTCGAAC GGGTCGTGAC CCGCGGCGGC
ACCGTCGTCC TGGGCGGGCA CGCCATGCTG GCCGCGGAGA TCCTGGCCGG CCGGCAGGTC
GGGATCCGCA TCGAACGCGA CACCCTGATG TTCTTCGACC TGGACACCCG TGAGTTGCTG
CGGGTGCGGC CGAACCCGTT GCCGCACGAG CAAATCGTTC GTCTCCGCGG GAACCGGCCG
GCCGGGCCGC CACCCCGGCC GTCTACCGAA CCGGTCCGGG TCCAGCGTCG CGCGTCGAGC
ACCGGCGTGA TCACCGTGTG CCGACAGAAA GTCGCGCTCG GCCGCACCCA TCAGCACCAG
ACCGTGACCG TTCACGTGTC CGACACCACC CTGGCCATCG AATTCGACGA CGGCGAAACC
CGAATCATTC GACGGACCAC GACAATTCCG GTGCGTAACA TCAAGGCCGA CCGACCACGA
TCGGCCACAA CCCAAGTTGT CTAG
 
Protein sequence
MALVVLSKVE QRLDAVRAVL AGATVTEVAA AVGVSRVSVH AWLRRYLTEG VTGLADRSHR 
PRSCPHQAGD EVSVRVAELR RTHPRWGAKR IRMELLRKPA GLTVPSTATI NRILIRHGLV
TPRRRKRPRS SYQRWERPGP MQLWQLDIVG DVWLVNPATG VLRGVKVVTG VDDHSRFCVI
AAVVERATGR AVCLALAAAL ARFGVPGEIL TDNGKQFTAR FGRGGEVLFD KICRHNGITH
RLTQPASPTT TGKIERFHLT LRRELLDDHE PFESLAAAQA AVDEFVRVYN TERPHQALDG
QRPVSPADRF TPINPAEAEL VPLWLPPTVA PAPAPTTSAP SGDQPDSAGA ATPPGPGRGG
AGGAVEFDRV VPPAGNLQLA GRQLWLGPAR SGQVVRFWAD TTLIHLFIGG TRVKTVRSHL
TVTDLSALLA AGAVAAGPSP LPPVQPGDAV EVERVVTRGG TVVLGGHAML AAEILAGRQV
GIRIERDTLM FFDLDTRELL RVRPNPLPHE QIVRLRGNRP AGPPPRPSTE PVRVQRRASS
TGVITVCRQK VALGRTHQHQ TVTVHVSDTT LAIEFDDGET RIIRRTTTIP VRNIKADRPR
SATTQVV