Gene Noca_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0036 
Symbol 
ID4598390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp39830 
End bp41461 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content73% 
IMG OID639774651 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_921273 
Protein GI119714308 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCATG TAGGCGACGC GATGATCGAC GCGATCGCGA GCGCAGGCGT CCGACGGCTC 
TACACCGTGC CCGGCGAGAG CTTCTTGGAG GTGCTCGACG CCGCCGACCG GCACCCCGCG
CTCCGCCAGT TCTCCACCCG GCACGAGTCG GGCGCCTCGT TCATGGCCGA CGCGGACGCC
AAGACGTCCG GCGTGCCCGC CGTCGCGATG GCCACCCGGG GTCCGGGCGC GGCGAACCTC
AGCATCGGCG TGCACACCGC TCACCAGGAC GGCACGCCCA TGCTGGTTCT GATCGGGGAC
GTGGAGACGC CGCGGATCTT CCGCGGCGCC TTCCAGGAGG TCGATCTGCC GGCGTTCTAC
CGGCCGATCA CCAAGGCGGC CATGACCGCC CGCCGCGGGG ACCGGCTGCC CGAGATGGTG
ACCGACGCGC TCCGGATCGC AGTCTCCGGA CGCCCCGGCC CGGTCATGAT CTCGCTGCCC
GCCGACCTGC TCGCGGAAGA GTTCACCGGG CCTGCGCCGG CGCCGCTCGC GCCCGTCGCG
CCCGCCGGAT GTCCCCCCGA GACCGTCCGC CGGCTGCGGA GCACCCTGGA ACGGGCGCAG
CGACCGGTGG TCATCGCCGG CGCGGGCGTT CGCCACGCCA CCGACCGCCT CGTCGCGCTC
TGCGAGGCCT ACGGTCTCGG CGTGTACGCC GCGATGCGAC GGCAGGACGT CTTCCCCAAC
GACCACCCGC TCTACCTCGG CCACCTCGGT GTCTCGCCCG CGCCCGGCAC CGTGGACGCG
CTCCGCGAGG CCGACGTGGT GCTGATCCTC GGCGCTGCGC TGGACCAGAT GACGACCCAG
CAGTTCACGC TGCCCGCCGC CTCGGCGCAC CGGATCCACG TCGACGCCGA TCCGCTCACC
CTCGGCGCCG GCCTGCCCGT CGACGAGGCT GTCGTCGCCG AGCCGCGCGA GCTGATCGAG
GCGCTGCTAG CCGAACCGCC CACCGCGCCG CACCGGGCCT GGGACGCCGC GCACCGACGG
TTCCTGGAGT CGACATGCGC CCCCCCGCGC ACCCCGGCCC GGGGCTGCGA CCCCGGTGCT
GTGATCGAGG CCATGAAGCG CGCGTGGCCC GCCGACACGA TCGTCACCAG CGACGCGGGC
AACTTCTCGG GATTCCTCCA CCAGTACTGG CGGTTCACGA CCCCCCGCAG CCAGGCCGCG
CCCATCAACG GCGCGATGGG CTATGCCGTG CCAGGCGCCG TCGGCGCCAA GGCTGCCTCC
CCCGACCGGC ACGTGCTCGG CGTGGTCGGC GACGGTGGCT TCCTGATGAC CGGCAGCGAG
GTCGAGACCG CCGTGCGCTA CGGCCTGCCG CTGACGATCG TCGTCCTGCG CAACGGGCTC
TACGGCACGA TCGCGATGCA CCAGGCCCAG GAGATGGGCC GCCTGTCGGC CGTCGACATC
GGCGACGTGG ACCTCGCGGC ATACGGCCGC AGCCTCGGCG CGGAGGGCAT CACGGTCGAG
GAGCCGGGTC AGCTCGACGA GGCCATGCGC ACTGCCGCGA CCAGCGATGC GGTCACCGTC
GTCGACGTCG TCACCGACCC GGACCTCATC ACCGCGTCCG GCCGGCTCTC GGAGATGTTC
CAGACCTCCT GA
 
Protein sequence
MPHVGDAMID AIASAGVRRL YTVPGESFLE VLDAADRHPA LRQFSTRHES GASFMADADA 
KTSGVPAVAM ATRGPGAANL SIGVHTAHQD GTPMLVLIGD VETPRIFRGA FQEVDLPAFY
RPITKAAMTA RRGDRLPEMV TDALRIAVSG RPGPVMISLP ADLLAEEFTG PAPAPLAPVA
PAGCPPETVR RLRSTLERAQ RPVVIAGAGV RHATDRLVAL CEAYGLGVYA AMRRQDVFPN
DHPLYLGHLG VSPAPGTVDA LREADVVLIL GAALDQMTTQ QFTLPAASAH RIHVDADPLT
LGAGLPVDEA VVAEPRELIE ALLAEPPTAP HRAWDAAHRR FLESTCAPPR TPARGCDPGA
VIEAMKRAWP ADTIVTSDAG NFSGFLHQYW RFTTPRSQAA PINGAMGYAV PGAVGAKAAS
PDRHVLGVVG DGGFLMTGSE VETAVRYGLP LTIVVLRNGL YGTIAMHQAQ EMGRLSAVDI
GDVDLAAYGR SLGAEGITVE EPGQLDEAMR TAATSDAVTV VDVVTDPDLI TASGRLSEMF
QTS