Gene Namu_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1765 
Symbol 
ID8447367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1931672 
End bp1932991 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID645040891 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_003201144 
Protein GI258651988 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00126506 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.299923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA GTAGTTCGCT GCTGTTGGAC ATCGACGGGC TGGTCGTCGA CCAGGTCGTC 
CGCGACGACG TCGGCCGACG GGTCGTGCAC TGCTCGACCG ACCCGCAGCT GGCCGGCTGG
TGCCCGGAGT GCGGTGAGCA GTCGAGGTCG CCGAAGGCGT GGGTGACGAC CCGCCCGCGG
GACGTCCGGC TGGGCGAGGA TCGGCCGATC CTGTTGTGGC GCAAACGCAA ATGGCGCTGC
CAGGTCGACG GCTGCGAGCG GAAGGTGTTC ACCGAGTGCC TGCCCGAGCA GATCCCCGCC
CGGGCCCGGA TCACCACCCG GGCCCGCCGA CTGGCGGCCG AGGCGATCGG CGACCACACC
CGGCCGGTGT CCGGTGTGGC GGCCGAATTC GGCATGGACT GGCGCGTTGC GCACGACGCG
TTCGTCGCGC ACGCCGCGCA GGTGTTGCCC GAGCAGCCGC CGCCGGTGAC CGTGCTGGGC
GTCGACGAGA CCCGCCGCGG CAAGGCCCAC TACGAGACCG ACCCCACCAC CGGGGCGAAG
ACCTGGGTGG ACCGCTTCGA CACCGGCCTG GTCGATCTGA GCGGCAACGG CGGGCTGTTC
GCGCAGGTCA ACGGCCGCAC CAGCAAGGTC CTCATCGAGT GGCTGCAGGC GCAGGACCCG
GACTGGCTCG CCAACATCAC CCACATCTCG ATGGACACCA GCGCGACCTA CGCCCGGGCC
GCCCGGCTGG CCCTGCCCGA TGCTGTCGTG GTCGTGGACC GGTTCCATCT GTCCGCCCTG
GCCAACAAGG CGGTCACCGA CTACCGGCGG GAGTTGGCCT GGGCGCTGCG TGGCCGGCGG
GGCCGCAAGA GCGACCCGGA GTGGGCGCAA CGGAACCGGC TGCTGCGGGC CGCGGAGTCC
CTGACCGACG ACGAGCTGGC CAAGGTGCAG GATGCGATGC GTCGGGCGGA CCCGTCCGGC
GGCCTTGAGA AGTGCTGGCA GGGCAAGGAA CTGCTCCGCA AGCTGCTCAA GCTGGCTGGC
ACGAACCCGG ACCGCGGACG GATCTTCAAC GCGCTGACCG CGTTCTATCT GCATTGCGCC
GACTCCGGGG TCGCGCAGCT GCGGCGGCTG GCGTGGACGG TGCACGCCTG GCAGAACTCG
ATCATCGCCG GGCTGCACAC CGGGATCAGC AACGGCCGCA CCGAGGGCTA CAACCGGATC
GTCAAACACA TCGGCCGGAT CGCCTTCGGC TTCCGCAACC AGGACAACCA GAAGCGCCGG
GTACGCTACG CCTGCACCCG GAAATCCCGG GCGTCAACCA GCCACGCGAA GCCCTGCTAA
 
Protein sequence
MDQSSSLLLD IDGLVVDQVV RDDVGRRVVH CSTDPQLAGW CPECGEQSRS PKAWVTTRPR 
DVRLGEDRPI LLWRKRKWRC QVDGCERKVF TECLPEQIPA RARITTRARR LAAEAIGDHT
RPVSGVAAEF GMDWRVAHDA FVAHAAQVLP EQPPPVTVLG VDETRRGKAH YETDPTTGAK
TWVDRFDTGL VDLSGNGGLF AQVNGRTSKV LIEWLQAQDP DWLANITHIS MDTSATYARA
ARLALPDAVV VVDRFHLSAL ANKAVTDYRR ELAWALRGRR GRKSDPEWAQ RNRLLRAAES
LTDDELAKVQ DAMRRADPSG GLEKCWQGKE LLRKLLKLAG TNPDRGRIFN ALTAFYLHCA
DSGVAQLRRL AWTVHAWQNS IIAGLHTGIS NGRTEGYNRI VKHIGRIAFG FRNQDNQKRR
VRYACTRKSR ASTSHAKPC