Gene Noca_4175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4175 
Symbol 
ID4596689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4410986 
End bp4412767 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content71% 
IMG OID639778781 
Productthiamine pyrophosphate protein 
Protein accessionYP_925359 
Protein GI119718394 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.296619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGA CCGTGGCCGA CCAGCTGCTC GCCCGCCTGC GGGAGTGGGG GGTGGCGCAG 
GTCTTCGGCT ATCCCGGCGA CGGGATCAAC GGGATCCTCG GCGCGTTCTC CCGCGCCGAC
GACCAGCCGC GCTTCATCCA GTCCCGCCAC GAGGAGATGA GCGCGTTCCA GGCGGTGGGC
TACGCGAAGT TCTCCGGCCG CCCCGGCGTC TGCATGGCGA CCTCCGGGCC CGGCGCGATC
CACCTGCTCA ACGGCCTCTA CGACGCGAAG CTAGACCACG TCCCCGTGGT GGCGATCGTC
GGGCAGACCA ACCGCACCGC GATGGGCGGC AGCTACCAGC AGGAGGTCGA CCTGATCAGC
CTGTTCAAGG ACGTCGCCGG CGACTACGTG CAGATGGTGA CCGTCCCCGA GCAGCTGCCG
AACGTGCTGG ACCGGGCGAT CCGGGTCGCG ACCGCCCGCC GCGCGCCGAC CGCGATCATC
GTGCCCAACG ACGTCCAGGA GCTGGAGTAC GCCGCCCCGC AGCACGCGTT CAAGATGGTG
CCCTCCAGCC TCACCCCGAC CCAGCGGCCC ACGGTCACCC CGGACCCCGC GGCGCTGCGC
CAGGCCGCCG ACGTCTTGAA CTCCGGCCGC AAGGTCGCGC TGCTGGTCGG CCAGGGCGCC
CGCGGCGCGG ACGCCGAGAT CGCCGAGGTG GTGGACCTGC TGGGCGCCGG GGTCGCGAAG
GCGCTGCTCG GCAAGGACGT GCTCAGCGAC GAGCTGCCCT GGGTGACCGG CTCGATCGGC
CTGCTCGGCA CCCGCCCGAG CTACGAGCTG ATGATGGGCT GCGACACGCT GCTGACCGTC
GGCTCGAGCT TCCCCTACAC GCAGTTCATG CCGGAGCTGG ACCAGGCCCG TGCCGTGCAG
ATCGACCTCG ACGGCACGAT GATCGGGATG CGCTACCCCT ACGAGGTCAA CCTCGTCGGC
GACGCGCAGG CCACGCTGCG CGCGCTGATC CCGCTGCTGG AGAAGCAGCA GGACCGCTCC
TGGCACGACG AGATCTGCGC GAACGTCACC GACTGGTGGG AGGTGATGGA CGCCGAGGCC
CACGTCGCGG CCGACCCGGT CAACCCGATG CGGATCTTCA ACGAGTTCTC GAAGGTCGCG
CCGACCGACG CGATCATCTC CTCCGACAGC GGCTCGGCCG CGAACTGGTA CGCCCGGCAC
GTCAAGATGC GCGGCCGGAT GCGCGGCTCG CTGTCGGGCA CGCTCGCGAC GATGGGCCCG
GCGGTGCCGT ACGCGATCGG CGCCAAGTTC GCCCACCCCG ACCGGCCCGC GATCGCCTTC
GAGGGCGACG GCGCGATGCA GATGAACGGG CTGGCCGAGC TGCTCACGAT CGCCCGCTAC
TGGCCGGAGT GGGCCGACCC GCGGCTGGTG GTCGCCGTAC TCCACAACAA CGACCTCAAC
CAGGTCACCT GGGAGCTGCG CGCGATGGGC GGGACGCCCA CCTTCGTGGA GTCCCAGGCG
CTGCCGGACG TCTCGTACGC CGACTTCGCG CGCTCGTGCG GCCTGGGCGC GACGACCGTG
ACCGACCCCG ACCAGCTCGC CGACGCGTGG CAGGTCGCGC TGTCCTCGGA CCGCCCGCAC
CTGCTCGACG TGCACTGCGA CCCCGACGTG CCGCCGATCC CGCCGCACGC CACCCTCGAG
CAGATGACCG CGATGGCCAA GGCGCTGATC AAGGGCGACA CCAGCCGCTG GGGCGTGATG
AAGGAGGGCA TCAGGACCAA GGCCCAGGAG CTGCTGCATT GA
 
Protein sequence
MTTTVADQLL ARLREWGVAQ VFGYPGDGIN GILGAFSRAD DQPRFIQSRH EEMSAFQAVG 
YAKFSGRPGV CMATSGPGAI HLLNGLYDAK LDHVPVVAIV GQTNRTAMGG SYQQEVDLIS
LFKDVAGDYV QMVTVPEQLP NVLDRAIRVA TARRAPTAII VPNDVQELEY AAPQHAFKMV
PSSLTPTQRP TVTPDPAALR QAADVLNSGR KVALLVGQGA RGADAEIAEV VDLLGAGVAK
ALLGKDVLSD ELPWVTGSIG LLGTRPSYEL MMGCDTLLTV GSSFPYTQFM PELDQARAVQ
IDLDGTMIGM RYPYEVNLVG DAQATLRALI PLLEKQQDRS WHDEICANVT DWWEVMDAEA
HVAADPVNPM RIFNEFSKVA PTDAIISSDS GSAANWYARH VKMRGRMRGS LSGTLATMGP
AVPYAIGAKF AHPDRPAIAF EGDGAMQMNG LAELLTIARY WPEWADPRLV VAVLHNNDLN
QVTWELRAMG GTPTFVESQA LPDVSYADFA RSCGLGATTV TDPDQLADAW QVALSSDRPH
LLDVHCDPDV PPIPPHATLE QMTAMAKALI KGDTSRWGVM KEGIRTKAQE LLH