Gene Namu_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3544 
Symbol 
ID8449163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3892370 
End bp3893944 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content77% 
IMG OID645042622 
ProductTAP domain protein 
Protein accessionYP_003202858 
Protein GI258653702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000370838 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00997148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGCG CGCGCCGGAT CGGCCCGCTC GCCGCCGGGT TGGCCGCGCT GACCATGGTG 
CTGCTGGCCG GCTGCACCCT GGGCCCGTCC CAACGGCCGG CGTTGGCCAC CTACGGCCCC
GGGCCGGCCG CCACCGTGAC CACCACCGCC CCGGCCAGCG GCACCGTCGG ACCCGGTGGT
CCCGGGCAGC GGGCCGACCC GATCCGCTGG CAGGGCTGCG CGGACGTCGC CGACACCGAC
CCGACGACCG GCCAGCAGTT CGACGTCGAC TGCGCCACCG TGGTCACCGA GGGGTCGGCG
GTCGGGTCGG CCGGGCGGCG GAGCATCGAG GTGGCCCGTG CCCGCGCCGC CGGCGTCGCC
GACGACGCCC CGGTCCTGGT GGTGCTGGAC GGTTCCCCTG GCCAGCACGG TCGCAGCGAC
GTGGCCGCCG TCGCCGCCGG CCTGTCCCCG GCCGTGCGCC AGCACTTCGC GGTGGTCACC
GTCGATCTGG CCGGCAGCGG TGACGCCGAC CCGATCGACT GCATCTCCCG TACCGACCTG
GGCGCGCTGG CCACCCTGGG GGCCGATCCG ACCCAACCCG AGGGGGCCAA GGCCCTGGCC
GAGGTGACCC GCTCGATCAC CTTCGAGTGC ACCGACCAGG GCGGTCCTGA CCTGCCGCTG
GTGAACACCA CCGAAGCCGC CGACGACCTG GACCATCTAC GGGCCGCGCT GGGCACCGAC
CGGCTGACCG TGCTCGGCCG CGGCGCCGGG GCGACCCTGG GCACCGTCTA CGCCGACCGG
TATCCGGGCC GGGTGCAGGC CGCGGTCCTG GATGCGCCGG CCAACCCGAT GGATGCCGCG
GACGCCAGGG CCGCGGCCGT CGGGGTGGCC GCGGAGAAGG CATTGGACGC GTTCGCGGCC
GCCTGCCCGA CCTTCAGCGG CGGCTGCCCG CTGGGCGCCG ATCCGCGCGG CACCGTGGAG
AAGACGGTTG CCCAGCTCGA CGCGGCCGCG ACCCCGGGCA CCGGCCGGGT GACCGGCGGG
TCGGTGCTGC TGAGCCTGCT GCTGGGACTG GGCGACCCGG CCGTCTGGCC CGGGCTGGCC
GGCGACATCG CCAAGGCCGG GCAGGGCGAC ACCACCGCCC TGGCCAGCCG CCTGAGCACC
GCGCTCGGCC TGTCCGACAG CCGGTCGTGG GTGACCCCGG CCCTGATCTA CGCCTGCAAC
GACACCGCGG TCCGGCTCGG CCCGGACCAG CTGGCCACCG CGGTCCAGGA CGTCCGGGCG
CAGGCCCCGC TGTTCGGGCC CTACGCCCTG GGCCTGCTCG GGGTGTGCGG GTCGTGGCCG
GCACCGGAGA ACGCGCTCGG CGCGGTCAAG GCCAACGGCG CGCCCCCGAT CCTGGTGCTC
GGCGCGGTCG ACGACCCCGT CGCCCCGTAC GAATCGGTGC GGGCCCTGGC CGGTCAGCTG
GCCTCGGCCG TCCTGGTCAG CTGGCAGTCC GGCACCCACG GCAGCTATCC GGCGAGCGCG
TGCGCCGCCG GGGCCGTCGA CGGCTACCTG CTGCAGGGTC AGTTGCCCGC CGTCGGCACC
CTCTGCCCGC CCTGA
 
Protein sequence
MSGARRIGPL AAGLAALTMV LLAGCTLGPS QRPALATYGP GPAATVTTTA PASGTVGPGG 
PGQRADPIRW QGCADVADTD PTTGQQFDVD CATVVTEGSA VGSAGRRSIE VARARAAGVA
DDAPVLVVLD GSPGQHGRSD VAAVAAGLSP AVRQHFAVVT VDLAGSGDAD PIDCISRTDL
GALATLGADP TQPEGAKALA EVTRSITFEC TDQGGPDLPL VNTTEAADDL DHLRAALGTD
RLTVLGRGAG ATLGTVYADR YPGRVQAAVL DAPANPMDAA DARAAAVGVA AEKALDAFAA
ACPTFSGGCP LGADPRGTVE KTVAQLDAAA TPGTGRVTGG SVLLSLLLGL GDPAVWPGLA
GDIAKAGQGD TTALASRLST ALGLSDSRSW VTPALIYACN DTAVRLGPDQ LATAVQDVRA
QAPLFGPYAL GLLGVCGSWP APENALGAVK ANGAPPILVL GAVDDPVAPY ESVRALAGQL
ASAVLVSWQS GTHGSYPASA CAAGAVDGYL LQGQLPAVGT LCPP