Gene Namu_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1966 
Symbol 
ID8447575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2172612 
End bp2173931 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID645041096 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_003201342 
Protein GI258652186 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0415277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0178041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA GTAGTTCGCT GCTGTTGGAC ATCGACGGGC TGGTCGTCGA CCAGGTCGTC 
CGCGACGACG TCGGCCGACG GGTCGTGCAC TGCTCGACCG ACCCGCAGCT GGCCGGCTGG
TGCCCGGAGT GCGGTGAGCA GTCGAGGTCG CCGAAGGCGT GGGTGACGAC CCGCCCGCGG
GACGTCCGGC TGGGCGAGGA TCGGCCGATC CTGTTGTGGC GCAAACGCAA ATGGCGCTGC
CAGGTCGACG GCTGCGAGCG GAAGGTGTTC ACCGAGTGCC TGCCCGAGCA GATCCCCGCC
CGGGCCCGGA TCACCACCCG GGCCCGCCGA CTGGCGGCCG AGGCGATCGG CGACCACACC
CGGCCGGTGT CCGGTGTGGC GGCCGAATTC GGCATGGACT GGCGCGTTGC GCACGACGCG
TTCGTCGCGC ACGCCGCGCA GGTGTTGCCC GAGCAGCCGC CGCCGGTGAC CGTGCTGGGC
GTCGACGAGA CCCGCCGCGG CAAGGCCCAC TACGAGACCG ACCCCACCAC CGGGGCGAAG
ACCTGGGTGG ACCGCTTCGA CACCGGCCTG GTCGATCTGA GCGGCAACGG CGGGCTGTTC
GCGCAGGTCA ACGGCCGCAC CAGCAAGGTC CTCATCGAGT GGCTGCAGGC GCAGGACCCG
GACTGGCTCG CCAACATCAC CCACATCTCG ATGGACACCA GCGCGACCTA CGCCCGGGCC
GCCCGGCTGG CCCTGCCCGA TGCTGTCGTG GTCGTGGACC GGTTCCATCT GTCCGCCCTG
GCCAACAAGG CGGTCACCGA CTACCGGCGG GAGTTGGCCT GGGCGCTGCG TGGCCGGCGG
GGCCGCAAGA GCGACCCGGA GTGGGCGCAA CGGAACCGGC TGCTGCGGGC CGCGGAGTCC
CTGACCGACG ACGAGCTGGC CAAGGTGCAG GATGCGATGC GTCGGGCGGA CCCGTCCGGC
GGCCTTGAGA AGTGCTGGCA GGGCAAGGAA CTGCTCCGCA AGCTGCTCAA GCTGGCTGGC
ACGAACCCGG ACCGCGGACG GATCTTCAAC GCGCTGACCG CGTTCTATCT GCATTGCGCC
GACTCCGGGG TCGCGCAGCT GCGGCGGCTG GCGTGGACGG TGCATGCCTG GCAGAACTCG
ATCATCGCCG GGCTGCACAC CGGGATCAGC AACGGCCGCA CCGAGGGCTA CAACCGGATC
GTCAAACACA TCGGCCGGAT CGCCTTCGGC TTCCGCAACC AGGACAACCA GAAGCGCCGG
GTACGCTACG CCTGCACCCG GAAATCCCGG GCGTCAACCA GCCACGCGAA GCCCTGCTAA
 
Protein sequence
MDQSSSLLLD IDGLVVDQVV RDDVGRRVVH CSTDPQLAGW CPECGEQSRS PKAWVTTRPR 
DVRLGEDRPI LLWRKRKWRC QVDGCERKVF TECLPEQIPA RARITTRARR LAAEAIGDHT
RPVSGVAAEF GMDWRVAHDA FVAHAAQVLP EQPPPVTVLG VDETRRGKAH YETDPTTGAK
TWVDRFDTGL VDLSGNGGLF AQVNGRTSKV LIEWLQAQDP DWLANITHIS MDTSATYARA
ARLALPDAVV VVDRFHLSAL ANKAVTDYRR ELAWALRGRR GRKSDPEWAQ RNRLLRAAES
LTDDELAKVQ DAMRRADPSG GLEKCWQGKE LLRKLLKLAG TNPDRGRIFN ALTAFYLHCA
DSGVAQLRRL AWTVHAWQNS IIAGLHTGIS NGRTEGYNRI VKHIGRIAFG FRNQDNQKRR
VRYACTRKSR ASTSHAKPC