Gene Namu_2622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2622 
Symbol 
ID8448234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2873317 
End bp2874357 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content72% 
IMG OID645041718 
Producttransposase IS4 family protein 
Protein accessionYP_003201961 
Protein GI258652805 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0000619925 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00356568 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCCGCA CGAACCGGGT TCGCGCCGAT ACCACGGTCG TCCCGGCGAA TGTGGCGTAC 
CCGACCGACT CGGGGTTGCT GGCCAAAGCG ATCCGCCGGA TCGGCACCAG CGTGAAGCGG
ATCCACGCGG CCGGCGGCGC GGTCCGCACA AGGGTGCGGG ACCGGTCCCG GTCCGCCGGA
GCGAAGGCGC ACGGGGTCGC TGCCAAGCTG CGGTCCCGCG CCCAACTGGG TCGAGACGAG
GCCCGCGCCG GGGTGCAGAA GATCACCGGC GAGCTCGCTG ACCTGGCCGA ACAGGCGATC
AAGGACACCC GCAAGCTGCT GGTCAACGCC CACCGCGCCG CGGACAGGGC CCTGGCCAGG
GCCAAGGCGC TGGCCAAGAC CGGGATCCGC GACGCGGCCG TCGGGCGACG CCGCGGCCGG
TTGGTCCGCG CGGTCAACGA CCTACAGAAC CTGGTCGAGG CGACCGAACG GATCATCGAG
CAGACCCGGA CCCGGCTGAC CGGCCGCACC CCGGACGGCG CCACCCGGGT GGTCAGCCTG
CACGACACCC AGGCCCGGCC GATCGCCAAG GGCCGCCTCG GTAAGCCGGT CGAGTTCGGC
TACAAGGGCC AGGTTGTCGA CAACCAGGAT GGCATCGTGC TGGACCACAA CGTCGAACTG
GGCAACCCGC CCGACGCACC ACAACTGGCG CCGGCGATCG ACCGGATCAC CGCGCGAACC
GGTCGGACGC CGCGCACGGT GACCGCGGAC CGCGGCTACG GCGAGGCCAG TGTCGACCAG
CAGCTGACCG ACCGCGGCGT GCGGAACGTC GTCATCCCCC GCAAGGGCAG ACCCGGCGCG
GCCCGCCGAG CCGTCGAGCA TCGGCCGGCG TTCCGGCGGA CCGTGAAGTG GCGAACCGGC
TGCGAAGGCC GGATCAGCAC CCTCAAACGC GGCTACGGAT GGGACCGCAC GCGCCTGGAC
TCGCTCGAAG GAGCCCGGAC CTGGACCGGA CAAGGGATCC TGACCCACAA CCTGGTCAAG
ATCGCCGCCC TGACCGCCTG A
 
Protein sequence
MVRTNRVRAD TTVVPANVAY PTDSGLLAKA IRRIGTSVKR IHAAGGAVRT RVRDRSRSAG 
AKAHGVAAKL RSRAQLGRDE ARAGVQKITG ELADLAEQAI KDTRKLLVNA HRAADRALAR
AKALAKTGIR DAAVGRRRGR LVRAVNDLQN LVEATERIIE QTRTRLTGRT PDGATRVVSL
HDTQARPIAK GRLGKPVEFG YKGQVVDNQD GIVLDHNVEL GNPPDAPQLA PAIDRITART
GRTPRTVTAD RGYGEASVDQ QLTDRGVRNV VIPRKGRPGA ARRAVEHRPA FRRTVKWRTG
CEGRISTLKR GYGWDRTRLD SLEGARTWTG QGILTHNLVK IAALTA