Gene Noca_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1649 
Symbol 
ID4600028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1751756 
End bp1752775 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID639776248 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_922849 
Protein GI119715884 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.264755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGCAC ACCAGATCTT CGTCCAGGAC GTCACGCTGC GCGACGGGAT GCACGCCGTG 
CGGCACCGGA TCGGGCTCGA CGACGTGCGC CGCATCGTCG CCGCGCTCGA CGCCGCCGGC
GTCGACGCCA TCGAGGTCGC CCACGGCGAC GGCCTCGCCG GCTCCTCGGT GAACTACGGA
CCCGGATCCC ACACCGACTG GGAGTGGATC GAGGCCGCGG CCGACGTGCT CGAGCGGGCC
CGCCTGACCA CGCTGCTGCT ACCCGGGGTC GGCACCATCC ACGAGCTCAA GACCGCCTAC
GACCTCGGGG TCCGCTCGGT CCGGGTCGCC ACGCACTGCA CGGAGGCCGA CATCTCCGCC
CAGCACATCA CCGCGGCCCG GGAGATCGGC ATGGACGTCT CCGGCTTCCT GATGCTCTCC
CACATGGCGC CGCCGGCGGA GCTCGCCAAG CAGGCCCTGC TCATGGAGTC CTACGGTGCG
CACTGCGTCT ACGTCACCGA CTCCGGCGGC CGGCTCACGA TGAACGACGT CCGCGACCGG
GTCGCGGCGT ACCGAGACGT CCTCGACCCT GCCACCGAGA TCGGCATCCA CGCCCACGAG
AACCTCTCGC TCTCGGTCGC CAACTCCGTC GTGGCCGTCG AGACCGGTGC GGTGCGGGTC
GACGCCTCCC TCGCCGGGCA CGGTGCCGGC GCCGGCAACT GTCCGATCGA GGCGTTCGTC
GCGGTGGCGA ACCTCTCCGG CTTCGAGCAC GGCTGCGACC TGTTCGCGCT GCAGGACGCC
GCCGACGACC TGGTCCGCCC GTTGCAGGAC CGCCCGGTCC GGGTGGACCG CGAGACCCTC
ACCCTCGGCT ACGCCGGGGT CTACTCCTCG TTCCTACGGC ACGCCGAGCG GGCCGCCGAT
CAGTACGGCG TCGACGTGCG CGAGCTGCTG ATGGAGTGCG GCCGGCGCGG CCTGGTCGGT
GGCCAGGAGG ACATGATCAT CGACATCGCG CTCGATCAGG TCGGCGCCGT CGCCAGCTGA
 
Protein sequence
MNAHQIFVQD VTLRDGMHAV RHRIGLDDVR RIVAALDAAG VDAIEVAHGD GLAGSSVNYG 
PGSHTDWEWI EAAADVLERA RLTTLLLPGV GTIHELKTAY DLGVRSVRVA THCTEADISA
QHITAAREIG MDVSGFLMLS HMAPPAELAK QALLMESYGA HCVYVTDSGG RLTMNDVRDR
VAAYRDVLDP ATEIGIHAHE NLSLSVANSV VAVETGAVRV DASLAGHGAG AGNCPIEAFV
AVANLSGFEH GCDLFALQDA ADDLVRPLQD RPVRVDRETL TLGYAGVYSS FLRHAERAAD
QYGVDVRELL MECGRRGLVG GQEDMIIDIA LDQVGAVAS