Gene Namu_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3047 
Symbol 
ID8448660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3355243 
End bp3356547 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content70% 
IMG OID645042130 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_003202372 
Protein GI258653216 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000291896 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00289702 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGATG ACGTTACCGG TGTGTTCGTG CTGCCCGGAT TCCGGGTGGT GTCCAGCGAC 
GTGCTCGACG ACGAATGGCA CCTGCTTGTG GAAACGCGGC GGGAGCCGAC GGGCTGCCCG
ACCTGCGGCG CGGTCGCCCG AGTCAAGGAC CGGCGCACGG TGACCGTGCG GGACCTGCCG
GCCGGCGGTG TGCCGGTCGT TCTCCGTTGG TGCAAGCGGA TTTTCGAGTG TCGATACGGG
TTGTGCGAGA AGAAGACCTG GACCGAGCAA CACGACGCGA TCGCCCCGCG TGTGGTGCTC
ACCGACCGTG CGCAGCAGTG GGCGTTCGAG CAGGTCGGCC ACCACGACCG GGCCGTGTCC
CGGGTCGCGG CCCAACTCGG CGTGTCCTGG CACACGATCA TGACCCAGGT CGTTGACCGC
GGCACCCCGC TGGTCGAGGA CCCGGACCGG CTGGCCGGGG TGAGCGCGGT CGGGGTCGAC
GAGACCTCGT TCCTGCGGGC CACCGGCACC CGGCACACCC AGTACGCCAC CGGCGTCGCC
GACCTGACCC CGGGCCGCCC ACCGCGGCTG CTGGACGTCG TACCGGGCCG CTCCGGCCGA
GTCCTGGGCG ACTGGCTGAC CGACCGCGAC GAGGCCTGGC GGTCCGCCGT TCTGACCGCG
TCGCTGGACC CGTTCCGCGG TTATGCCACC GCGCTCTCGG CGCATCTACC TGCGACGACG
CGCGTGCTGG ACGCCTTCCA CATCGTCAAG ATCGTGCTTC TCGCGGTCGA CCAGGTCCGC
CGCCGCGTGC AGCAGGACAC CACCGGGCAC CGCGGCCGGG CCGGCGACCC GCTGTACCGG
GTACGACGGA TCCTGCGGCG CCGCTACGAC CGGCTCACCG ACCGGCAACT GGTCCGACTG
CGGGCCGCAC TGACCGACGC GGACACGCAC GAGGAGATCA CCGCCGCCTG GCTCGTCGCG
CAGAACGTCA TGCAGGCCTA CGCCAACCCC GACCGGGCCG CCGGACGCGC CGCGGCCGAG
CAGGTCATCA CCCTGGCCAA GACCTGCCCG GTTCCCGAGA TCGCCCGCTT CGGGCGCACT
CTGGTCGCGT GGCGGACCGA GTACCTGGCC CGGTTCGACA ACCCCGCCCT ATCCAACGGA
CCCACCGAGA ACCTCAACCT GAAGATCAAG AACACCAAAC GGATCGCCCG CGGCTACCGA
TCGTTCGCCA ACTACCGTCT GCGACTACTT CTCAACCACG GCCTCATCAG GCAAGATCAA
CAACCGGCAC GGATCCGACC ACACCGACCC AGGTTGATTG CGTAG
 
Protein sequence
MSDDVTGVFV LPGFRVVSSD VLDDEWHLLV ETRREPTGCP TCGAVARVKD RRTVTVRDLP 
AGGVPVVLRW CKRIFECRYG LCEKKTWTEQ HDAIAPRVVL TDRAQQWAFE QVGHHDRAVS
RVAAQLGVSW HTIMTQVVDR GTPLVEDPDR LAGVSAVGVD ETSFLRATGT RHTQYATGVA
DLTPGRPPRL LDVVPGRSGR VLGDWLTDRD EAWRSAVLTA SLDPFRGYAT ALSAHLPATT
RVLDAFHIVK IVLLAVDQVR RRVQQDTTGH RGRAGDPLYR VRRILRRRYD RLTDRQLVRL
RAALTDADTH EEITAAWLVA QNVMQAYANP DRAAGRAAAE QVITLAKTCP VPEIARFGRT
LVAWRTEYLA RFDNPALSNG PTENLNLKIK NTKRIARGYR SFANYRLRLL LNHGLIRQDQ
QPARIRPHRP RLIA