Gene Namu_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4081 
Symbol 
ID8449704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4504894 
End bp4505952 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID645043128 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_003203360 
Protein GI258654204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.389016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000053285 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCACGA TCGAACAGAC AATTGCAAAT TCCACCACTT CGGACCGGCC GTTCGTCCGG 
CTGACCGACA GCACGCTGCG CGACGGGAGC CACGCCGTCC GGCACCAGTT CACGACCGAC
AACGTCACCG ACGTGGTCAC CGCACTGGAT GCGGCCGGGG TCTCGGTCAT CGAGGTGACC
CACGGGGACG GGCTGGCCGG CTCGTCGTTC AACTACGGCT TCGGCAAGCA CACCGACGCC
GAGCTGGTGG CCGCCGCGGT CCGCGCCGCC ACCCGGGCCA AGATCGCCGT CCTGGTGCTG
CCGGGGCTGG GCACCGTGCA CGACCTGAAG CAGGTGCACA GCGCCGGCGC GCAGATCGCC
CGGGTCGCGA CCCACTGCAC CGAGGCCGAC GTCTCCGTCG AGCACTTCAC CGCGGCCCGC
GAGCTGGGCA TGGAAACCGT TGGCTTCCTG ATGCTTTCGC ATCGGATCGG GCCGGAGCAG
CTGGCCAAGC AGGCCCGGAT CATGGCCGAC GCCGGCTGCC AGTGCGTGTA CGTCGTCGAT
TCGGCCGGCG CGTTGCTGCC GGACATGGTC CGCGACCGGG TGCAGGCGCT GGTGGCCGAG
CTCGGCGACG ACGCGCAGGT CGGCTTCCAC GGTCACCAGA ACCTGTCGCT GGGCGTGGCC
AACTCGATCG TCGCCTACGA GAACGGGGCC CGGCAGATCG ACGGCACGCT GTGCGCGCTG
GGGGCCGGTG CGGGCAACTC GCCGACCGAG ATCCTGGCGA CGGTCTTCGA CGTCATGGGC
GTGCCGACCG GGGTCGACGC GGCCAAGGTC CTGGACGCCG CCGAGGACAT TGTCAAGCCG
ATGATCACCC GCATGCCGGT CGCCGACCGA GCCTCAATCG TGCAGGGGCG CTACGGTGTT
TACAATTCAT TCCTTTTGCA CGCCGAACGT GCAGCGGACC GATACGGTGT GTCTTCTCAC
GAAATACTCC GAAAGGTCGG CGAGGCTGGT TACGTCGGCG GACAGGAGGA CATGATCATC
GATGTCGCCA TCGGGCTCGC GGCGCAGCGA ACCGGTTGA
 
Protein sequence
MTTIEQTIAN STTSDRPFVR LTDSTLRDGS HAVRHQFTTD NVTDVVTALD AAGVSVIEVT 
HGDGLAGSSF NYGFGKHTDA ELVAAAVRAA TRAKIAVLVL PGLGTVHDLK QVHSAGAQIA
RVATHCTEAD VSVEHFTAAR ELGMETVGFL MLSHRIGPEQ LAKQARIMAD AGCQCVYVVD
SAGALLPDMV RDRVQALVAE LGDDAQVGFH GHQNLSLGVA NSIVAYENGA RQIDGTLCAL
GAGAGNSPTE ILATVFDVMG VPTGVDAAKV LDAAEDIVKP MITRMPVADR ASIVQGRYGV
YNSFLLHAER AADRYGVSSH EILRKVGEAG YVGGQEDMII DVAIGLAAQR TG