Gene Noca_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3306 
Symbol 
ID4600198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3512158 
End bp3513924 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content73% 
IMG OID639777912 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_924495 
Protein GI119717530 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.364565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGG GCCTGACCGG TGGTCAGCTG ATCGCCAAGA TGCTGGCGGC CGAGGCGGTC 
GACCAGGTGT TCGGGATCAT CGACGGCACG TACTTCGGGC TCTACCGGGC GCTGGGCGAC
GAGGGCATCA CGCTGCACTC CCCGCGCCAC GAGGCGGCCG CGGCGCACAT GGCGGGCGCC
TACGCCCGGG TCAGCGGACG GCTCGGGGTC TGCATGGCCT CCAACGGCCC CGGCGTCGCG
AACGTGCTGC CCGGCGTCGC CGTCGAGCAG GGCGAGGGCA ACCGGATCCT GCTGATCACC
AGCGGCCGGC GGGTCGGCAT CACCCACCCC GACCGCGGCG GCACCTACCA GAACTTCGAC
CAGACCGCCG TGACGGCGCC GATGGCCAAG TGGTCCTGCC ACGTGCCGAG CGCCGACCGA
CTGCCCGAGA TCCTGCGGCG GGCGCTGCGG GTGAGCTTCA GCGGCCGGCC GGGCGTGGTG
CACGTCGACG TCCCCGAGAA CATCGTCAAC CAGCCGACCG GCCTCGACGC CGGGGCGGTG
CGGGCGCCGG AGTCCTACCG GCGTACGACG CCGCTGCAGC CCGACCCGGT CCTCGTCGAG
AAGGCCGCGC AGCTGCTCGT CGAGGCCGAG TGGCCGATGA TCCACGCCGG CTCCGGGGTC
TACCACGCCG GCGCGGAAGC CGAGCTCGCG CGGCTGGCCG GCCTGCTCGC CGCACCGGTG
ACCACCAGCT GGGCGGCCCG CGGCGCACTG TCGGAGGCCC GGCCCGAGGC CATCCCGATG
ACGGCACTGG GCGTCAACGA CGAGGTCCGC AACGCCGCCG ACGTGGCACT GGTCGTCGGC
AGCCGGCTGG GGGAGACCGA CTGGTGGGGC AAGGCGCCGA ACTGGGCGCC GCCGGCGCTG
CAGAGGACGA TCCAGGTCGA CGTCGACGAC GAGCGGCTGG GTGTCAACAA GCCGGTCAGC
CTGGCGATCC TGGCCGACGC CCGCGAGGCG CTCCGGGCCC TGGCCGACGC GGTGGAGCGG
CGGACGGTCC CGGGGCTGGC CGCGCGCCGG GACCGGCTCG AGGTGTGGCG CGGCAAGGTC
GCCGACGAGC GGGCGAAGCT CGCCAAGGCG GTCAAGCCCG GGAGCCCGGT CCACCCCGGT
CACGTGCCGT CGGTCGCGCA GCAGGTGATG CCCGACGACA CCGTGTGGGT CTTCGACGGC
GGCAACACGG TCGTGTGGTC GAACTTCCAC CACTCGGCCG ACGTGCCGCG GACCATCCTG
TCGACGTACA AGTTCGGCAT GCTCGGCGCC GGCATGGGCC AGGCCCTCGG CGCCGCGGTC
GCGGCGCCCG ACCAGCGGGT CTGCGTGCTC ATCGGCGACG GTGCGTTCGG CATGCACCCG
ACCGAGGTCG AGAGCGCGGT GCGCCTCGGG CTGCCCATCG TCTACGTGGT CCTGGTCGAC
GGCGCGTGGG GCATGGTCAA GATGAGCCAG CAGATCCAGG CCCGCCCGAT CGCGACGGTG
GCGCACAAGA TGCTGACCAA CACCACGCTG CCGGCGGACC AGATCGTGTA CGCCGACTTC
GCGCCCTGCC GCTACGACCT GATGGGCGAG GCGATGGGAG CCCACGGCGA GCTGGTGACC
GACTCGGCCG ACCTGCCGGC CGCGCTGGAG CGGGCCCGCG ACTGCGGCCG CGCCGCCGTG
GTGCAGGTGC ACGTCGACCA GGTCGAGCAC CTCTGGGCGC CGGGCCTGCA AGCGTTCAAG
AAGATGCACC AGGAGCCGAA GGGCTGA
 
Protein sequence
MAKGLTGGQL IAKMLAAEAV DQVFGIIDGT YFGLYRALGD EGITLHSPRH EAAAAHMAGA 
YARVSGRLGV CMASNGPGVA NVLPGVAVEQ GEGNRILLIT SGRRVGITHP DRGGTYQNFD
QTAVTAPMAK WSCHVPSADR LPEILRRALR VSFSGRPGVV HVDVPENIVN QPTGLDAGAV
RAPESYRRTT PLQPDPVLVE KAAQLLVEAE WPMIHAGSGV YHAGAEAELA RLAGLLAAPV
TTSWAARGAL SEARPEAIPM TALGVNDEVR NAADVALVVG SRLGETDWWG KAPNWAPPAL
QRTIQVDVDD ERLGVNKPVS LAILADAREA LRALADAVER RTVPGLAARR DRLEVWRGKV
ADERAKLAKA VKPGSPVHPG HVPSVAQQVM PDDTVWVFDG GNTVVWSNFH HSADVPRTIL
STYKFGMLGA GMGQALGAAV AAPDQRVCVL IGDGAFGMHP TEVESAVRLG LPIVYVVLVD
GAWGMVKMSQ QIQARPIATV AHKMLTNTTL PADQIVYADF APCRYDLMGE AMGAHGELVT
DSADLPAALE RARDCGRAAV VQVHVDQVEH LWAPGLQAFK KMHQEPKG