Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4081 |
Symbol | |
ID | 8449704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4504894 |
End bp | 4505952 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645043128 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_003203360 |
Protein GI | 258654204 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.389016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000053285 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCACGA TCGAACAGAC AATTGCAAAT TCCACCACTT CGGACCGGCC GTTCGTCCGG CTGACCGACA GCACGCTGCG CGACGGGAGC CACGCCGTCC GGCACCAGTT CACGACCGAC AACGTCACCG ACGTGGTCAC CGCACTGGAT GCGGCCGGGG TCTCGGTCAT CGAGGTGACC CACGGGGACG GGCTGGCCGG CTCGTCGTTC AACTACGGCT TCGGCAAGCA CACCGACGCC GAGCTGGTGG CCGCCGCGGT CCGCGCCGCC ACCCGGGCCA AGATCGCCGT CCTGGTGCTG CCGGGGCTGG GCACCGTGCA CGACCTGAAG CAGGTGCACA GCGCCGGCGC GCAGATCGCC CGGGTCGCGA CCCACTGCAC CGAGGCCGAC GTCTCCGTCG AGCACTTCAC CGCGGCCCGC GAGCTGGGCA TGGAAACCGT TGGCTTCCTG ATGCTTTCGC ATCGGATCGG GCCGGAGCAG CTGGCCAAGC AGGCCCGGAT CATGGCCGAC GCCGGCTGCC AGTGCGTGTA CGTCGTCGAT TCGGCCGGCG CGTTGCTGCC GGACATGGTC CGCGACCGGG TGCAGGCGCT GGTGGCCGAG CTCGGCGACG ACGCGCAGGT CGGCTTCCAC GGTCACCAGA ACCTGTCGCT GGGCGTGGCC AACTCGATCG TCGCCTACGA GAACGGGGCC CGGCAGATCG ACGGCACGCT GTGCGCGCTG GGGGCCGGTG CGGGCAACTC GCCGACCGAG ATCCTGGCGA CGGTCTTCGA CGTCATGGGC GTGCCGACCG GGGTCGACGC GGCCAAGGTC CTGGACGCCG CCGAGGACAT TGTCAAGCCG ATGATCACCC GCATGCCGGT CGCCGACCGA GCCTCAATCG TGCAGGGGCG CTACGGTGTT TACAATTCAT TCCTTTTGCA CGCCGAACGT GCAGCGGACC GATACGGTGT GTCTTCTCAC GAAATACTCC GAAAGGTCGG CGAGGCTGGT TACGTCGGCG GACAGGAGGA CATGATCATC GATGTCGCCA TCGGGCTCGC GGCGCAGCGA ACCGGTTGA
|
Protein sequence | MTTIEQTIAN STTSDRPFVR LTDSTLRDGS HAVRHQFTTD NVTDVVTALD AAGVSVIEVT HGDGLAGSSF NYGFGKHTDA ELVAAAVRAA TRAKIAVLVL PGLGTVHDLK QVHSAGAQIA RVATHCTEAD VSVEHFTAAR ELGMETVGFL MLSHRIGPEQ LAKQARIMAD AGCQCVYVVD SAGALLPDMV RDRVQALVAE LGDDAQVGFH GHQNLSLGVA NSIVAYENGA RQIDGTLCAL GAGAGNSPTE ILATVFDVMG VPTGVDAAKV LDAAEDIVKP MITRMPVADR ASIVQGRYGV YNSFLLHAER AADRYGVSSH EILRKVGEAG YVGGQEDMII DVAIGLAAQR TG
|
| |