Gene Namu_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1473 
Symbol 
ID8447069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1626891 
End bp1628594 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content71% 
IMG OID645040602 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_003200861 
Protein GI258651705 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0401989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.232959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGT CGACCACCTC CGCCGTCTCG CCGGTTGCCC CGCCGCGAGT TGCGACCCGT 
CGCAGCTCGC CGCTGGGCGA TGCCTTCCAC GTCTTCGACA CCACGCTGCG CGACGGTGCC
CAGCGCGAGG GCATCACCTA CTCGGTGGCC GACAAGATCG CGGTCGCGAC GCTGCTCGAC
GAGCTGGGGG TCGGCTTCAT CGAGGGTGGC TGGCCGGGGG CGATGCCCAA GGACACCGAG
TTCTTCGCCC GGGCCGCGGC CGGGGAGCTG GCGCTGACCA CGGCCAAGCT GGTCGCCTTC
GGGGCCACCC GTAAGGCCGG CACCCGGGCC GATCAGGACC CGCAGGTCCG GGCCCTGCTC
GACTCCGGGG CCGAGGTGAT CACGCTGGTC GCCAAGTCCG ACGTGCGGCA TGTGGAACGC
GCCCTGCGCA CCACGCTGAG CGAGAACCTG GCCATGGTCG GGGACACCGT GCGGCTGCTG
GTCGACCACG GCCGGCGGGT GTTCCTGGAC TGCGAGCACT TCTTCGACGG GTACGCCGCC
GACCGGGACT ACGGGCTGCG GGTGGCCGAG GCGGCGGTGA GTGCCGGCGC CGACGTGGTC
GTGCTGTGTG ACACCAACGG CGGCATGCTC CCGATGGGCA TCGAGCGGGT GGTCCGTGAG
GTCCGCGAGC GCGGCGGCTT CCGCGTCGGC ATCCATTGCC AGGACGACAC CGGCTGCGCG
GTGGCCAACA CGGTCGCCGC GGTGCAGGCC GGGGCCACCC ATGTGCAGTG CACGGCCAAC
GGTTACGGCG AGCGGGCCGG CAACGCCGAC CTGTTCGCGG TGGTGGGCAA CCTGACCACC
AAGATGAACC TGCCTGTGCT GCCGGAGGGC GCGCTGCCGG AGATGATGCG GGTTTCGCAT
GCGCTGGCCG AGCTGGCCAA CATCGCCCCC AACACCCACC AGGCCTACGT CGGCTCGTCC
GCGTTCTCGC ACAAGGCCGG CCTGCACGCC TCGGCGATCA AGGTCGACCC GGACCTGTAC
AACCACCTGG ACCCGACCGT CGTCGGCAAC GACATGCACA TCCTGATCAC CGAGATGGCC
GGCCGGGCCT CGGTCGAGCT CAAGGCCAAG GAACTCGGCG TCGATCTGGC CGGGCGCACC
GACGCGGTGG GCCGGATCGT CGACGCGGTC AAGGACCGTG AGGCCGCCGG CTGGTCGTAC
GAGGCGGCGG ACGCCTCCTT CGAGCTGCTG ATCCGCGACG AGCTGGCGGC CACCGACGCC
GCCGCGGCCG GCGTCGAGGT GCCGCCGCGG CCGTTCACCC TCGAGTCCTA CCGGGTGATC
GTGGAGAACA CCGGCGGCAC GGTCAGCAGT GAGGCCACGG TGAAGGTGCA TGTCGGCGAC
CGGCGGATCA TCACCACCGC CGAGGGCAAC GGACCGGTCA ACGCCCTGGA CTCGGCGTTG
CGCAGCGCCG TCGCCGAGCG CTACCCGGAG CTCAACGAGG TCGAACTGGT CGACTACAAG
GTCCGCATCC TGGCCGGCCA TGCCGGCACC GACTCGATCA CCCGGGTGTT GGTGTCCAGC
TCGATCCAGG GCCTGCGCGG CCCCCAGGAA TGGACCACCG TGGGCGTGCA CGCCAACGTC
GTCGAGGCCT CCTGGCAGGC GCTGGTCGAC GCACTCGTCT ACGTCCTGCC CTCGTCGACG
GGAAACCCCG TCCTGCCCAC CTGA
 
Protein sequence
MTVSTTSAVS PVAPPRVATR RSSPLGDAFH VFDTTLRDGA QREGITYSVA DKIAVATLLD 
ELGVGFIEGG WPGAMPKDTE FFARAAAGEL ALTTAKLVAF GATRKAGTRA DQDPQVRALL
DSGAEVITLV AKSDVRHVER ALRTTLSENL AMVGDTVRLL VDHGRRVFLD CEHFFDGYAA
DRDYGLRVAE AAVSAGADVV VLCDTNGGML PMGIERVVRE VRERGGFRVG IHCQDDTGCA
VANTVAAVQA GATHVQCTAN GYGERAGNAD LFAVVGNLTT KMNLPVLPEG ALPEMMRVSH
ALAELANIAP NTHQAYVGSS AFSHKAGLHA SAIKVDPDLY NHLDPTVVGN DMHILITEMA
GRASVELKAK ELGVDLAGRT DAVGRIVDAV KDREAAGWSY EAADASFELL IRDELAATDA
AAAGVEVPPR PFTLESYRVI VENTGGTVSS EATVKVHVGD RRIITTAEGN GPVNALDSAL
RSAVAERYPE LNEVELVDYK VRILAGHAGT DSITRVLVSS SIQGLRGPQE WTTVGVHANV
VEASWQALVD ALVYVLPSST GNPVLPT