Gene Namu_2536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2536 
Symbol 
ID8448147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2788637 
End bp2789737 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID645041644 
Productputative transposase 
Protein accessionYP_003201888 
Protein GI258652732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000586897 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000186966 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGCATC CCGGTCGCCC GAAGGCTGCC CTCGAGCTCG CTGATGATGA GCGAGAAGCG 
TTGACGAGGT GGGCCCGTCG ACCCACCTCG TCGCAGCAGC TGGCGTTGCG TTCTCGCATC
GTGTTGGTCT GCGCGGACGG GCACACGAAC ACCGCGGTCG CCGAGCAGCT GGGCATCAAC
AAGGTGACCG TGGGCAAGTG GCGGGCCCGG TTCGTGCGGC ACCGGCTGGA GGGGCTGACG
GACGAGCCGC GTCCGGGCGC GCCGCGCACG GTGATCGACG ACGCGGTGGA GCGGGTGATC
GTTAAAACCC TGGAGTCCAA GCCGGTCGAC GCGACCCACT GGTCGACGCG GTCGATGGCC
GCGGCCACCG GGATGAGCCA GACGGCGATC TCGCGGATCT GGCGGGCGTT CGGGCTCAAA
CCGCACCGGC AGGAGAGCTT CAAGCTGTCC ACGGACCCTC AATTCGTGGA GAAGGTCCGC
GACGTCGTCG GCCTGTACCT GAACCCGCCG GAGCAGGCGC TGGTGTTCTG CGTGGATGAG
AAGACCCAAG TGCAGGCCCT GGACCGATCC CAGCCGGTCA TCCCGATGAT GCCCGGCACC
CCGGAACGCC TCACCCACGA CTACATCCGG GCCGGCACGC TGGACCTGTT CGCCGCCCTC
GAGGTTGCCG GCCCCACCGC CGGGCGGGTG ATCACTCAAC TGCACCCGCA GCATCGCGCG
ATCGAGTTCC GTAAGTTCCT GGTCGCGATC GATAAGGCGG TCCCGGCCGG CTACGACGTC
CACCTGGTCG TGGACAACCT GTCCACCCAC AAGACCGCGG CGATCCACAA GTGGCTCCTG
GCGCACCCCC GATTCCACCT GCACTTCACC CCGACCGGGT CGTCCTGGCT CAACCTGGTC
GAACGCTGGT TCGCCGAAAT CACCATGAAA CTGATCCGCC GCGGAGTCCA CTGCAGCGTC
AAAGACCTCG CCAAGGACAT CACCAACTGG GCCGAGAACT GGAACTCCGA CCCCAAGCCC
TACGTCTGGA CCAAGACCGC AGACGACATC CTCGACACCC TCGCCGCATA TTGTCAGCGA
ATCAATGACT CAGCACACTA G
 
Protein sequence
MPHPGRPKAA LELADDEREA LTRWARRPTS SQQLALRSRI VLVCADGHTN TAVAEQLGIN 
KVTVGKWRAR FVRHRLEGLT DEPRPGAPRT VIDDAVERVI VKTLESKPVD ATHWSTRSMA
AATGMSQTAI SRIWRAFGLK PHRQESFKLS TDPQFVEKVR DVVGLYLNPP EQALVFCVDE
KTQVQALDRS QPVIPMMPGT PERLTHDYIR AGTLDLFAAL EVAGPTAGRV ITQLHPQHRA
IEFRKFLVAI DKAVPAGYDV HLVVDNLSTH KTAAIHKWLL AHPRFHLHFT PTGSSWLNLV
ERWFAEITMK LIRRGVHCSV KDLAKDITNW AENWNSDPKP YVWTKTADDI LDTLAAYCQR
INDSAH