Gene Noca_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2149 
Symbol 
ID4599209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2298450 
End bp2299478 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content64% 
IMG OID639776752 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_923345 
Protein GI119716380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGAAA CGCGTACGAC TGTCTTTGTC CAGGACGTCA CTCTCCGCGA CGGCATGCAC 
GCCATCCGAC ACCAGCTTGC GCCCGGCGAG GTAGCCCAGA TTGCCGCAGC CCTTGATGTC
GCAGGGGTCG ACGCCATCGA AATCTCGCAT GGCGACGGTC TCGCCGGAAG CAGCCTCAAC
TACGGTCCCG GAAGCCATAC CGACTGGGAG TGGATCGAGG CTGCAGCTGC CAACATCCAA
CGCGCTCGGT TGACCACGCT CCTACTTCCG GGCATCGGGA CGGTGGATGA ATTGCGCAAG
GCGCACGACC TCGGTGTTCG CTCCGTTCGT GTCGCGACGC ATTGCACGGA AGCCGACGTT
TCGGCCCAGC ACATCGAGAC GGCGCGCGAC CTCGGCATGG ATGTCGCAGG ATTTCTGATG
ATGAGCCACA TGGCTGCGGC CAGCGAGCTG GCAGCCCAAG CGGCTCTCAT GGAGTCCTAC
GGCGCCCATT GCGTCTACGT GACGGACTCG GGTGGCCGGC TGACCATGGA CGCCGTGCGT
GACCGCGTCC GGGCATACCG TGACGTCTTG GATGCCACGA CCGAGATCGG TATCCATGCC
CATGAGAACT TGTCTCTGTC GGTCGCTAAC AGCGTGGTTG CGGTTGAAGC GGGCGTCACT
CGGGTCGACG CCTCGCTCGC GGGACAAGGT GCAGGTGCGG GAAACTGCCC CATCGAGGCC
TTTGTCGCCG TGGCCAATAT CCTCGGCTGG CAACATGGCT GCGACCTCTA CCAACTGCAA
GACGCTGCCG AGGACCTCGT TCGCCCGCTC CAAGACCGGC CTGTACGCGT GGACCGGGAA
ACCTTGACGC TCGGCTACGC CGGCGTGTAC TCCAGCTTCT TGCGGCACGC CGAGAAGGCG
GCTCAGACCT ACGACCTCGA CGTTCGAACC ATCCTGACCG AGGTGGGGAA TCGCCGGCTC
GTCGGAGGCC AAGAAGACAT GATCGTCGAC ATCGCCATGG AACTGTCCGA GGTAGCGGCA
GACCGTTGA
 
Protein sequence
MNETRTTVFV QDVTLRDGMH AIRHQLAPGE VAQIAAALDV AGVDAIEISH GDGLAGSSLN 
YGPGSHTDWE WIEAAAANIQ RARLTTLLLP GIGTVDELRK AHDLGVRSVR VATHCTEADV
SAQHIETARD LGMDVAGFLM MSHMAAASEL AAQAALMESY GAHCVYVTDS GGRLTMDAVR
DRVRAYRDVL DATTEIGIHA HENLSLSVAN SVVAVEAGVT RVDASLAGQG AGAGNCPIEA
FVAVANILGW QHGCDLYQLQ DAAEDLVRPL QDRPVRVDRE TLTLGYAGVY SSFLRHAEKA
AQTYDLDVRT ILTEVGNRRL VGGQEDMIVD IAMELSEVAA DR