Gene Noca_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3987 
Symbol 
ID4598122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4206245 
End bp4207519 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID639778592 
Productthreonine synthase 
Protein accessionYP_925171 
Protein GI119718206 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0874528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCG TCGTCATCGA GAAGACCAGC ACGCCGGGCC TGCGCGAGGG CGCGTTCGGC 
AACGCCACCG CCCTCTCCTG CCGCGAGTGC GGCCACCAGG TCGAGCTCGG ACCGCACTAC
GCGTGTCCGG AGTGCTTCGG CCCCCTGGAG ATCGCCTACG ACTTCCCCCG GGTCACCCGC
GAGGAGATCG AGGCCGGCCC GCGCAACATC TGGCGCTACA AGGCGCTGCT GCCGGTGCCC
GCCGACATCG AGGACAGCCC CAACACCGAG CCCGGCTTCA CCCGCCTGCT GCGTGCGGGC
AACCTGGCCG CCGAGCTCGG GATCGCGAAC CTGTGGGTCA AGGACGACTC CACCAACCCC
ACCAACTCCT TCAAGGACCG CGTCGTCGCC TGCGCGCTGA GCGCGGCCCG CGAGTTCGGC
AGCAAGGTCT TCGCCTGTCC GAGCACCGGC AACCTCGCCA ACGCGGTCGC CGCCGCGGGC
GCCCGCGCCG GCATCAAGAC CGTGGTGTTC ATCCCGAGCA ACCTCGAGCA GCCCAAGCAG
GTCAACTCCG CCGTCTTCAC CGACTCCCTG GTGGCCGTCA ACGGCAACTA CGACGACGTC
AACCGGCTCG CTTCCGAGAT CGCCGGCGAG GAGGAGGGCT GGGCGTTCGT GAACGTCAAC
GTCCGCCCCT ACTACGCCGA GGGCTCCAAG ACCCTCGGCT ACGAGATCGC CGAGCAGCTC
GGCTGGCGGC TGCCGGACCA GATCGTGATC CCGGTCGCGA GCGGCTCGCA GCTGACCAAG
GTGCACAAGG CCTTCCAGGA GCTGATCCGG CTCGGCCTCG TGGAGGACAA GCCCTACCGC
GTGTACGGCG CGCAGGCCGC GGGCTGCTCC CCGGTCTCGG TCGCCTACAA GGCCGGCGTC
GACGCCATCC GCCCGGTCAA GCCGGACACG ATCGCCAAGA GCCTGGCGAT CGGCAACCCC
GCCGACGGCA TCTACGTCCT CGACATCTGC CGCGAGACCG GCGGCGCGGT CGAGGACGTC
ACCGACGACG AGATCCGCGC CGGCATCGTG CTGCTGGCCC GCACCGAGGG GATCTTCACC
GAGACCGCCG GCGGCACCAC CGTCGCCGTG TTGAAGAAGC TCGTCGAGAC CGGCCAGCTC
GACACCTCGC TCGAGACCGT GGTGATCAAC ACCGGCCACG GCCTGAAGAC CCTGGACGCG
GTCTCCGGCA CCGTGGCGCC CGCCGCGACC ATCGACCCGT CGTACCCTGC CTTCGCCGCC
ACCGGCCTGG CCTGA
 
Protein sequence
MSAVVIEKTS TPGLREGAFG NATALSCREC GHQVELGPHY ACPECFGPLE IAYDFPRVTR 
EEIEAGPRNI WRYKALLPVP ADIEDSPNTE PGFTRLLRAG NLAAELGIAN LWVKDDSTNP
TNSFKDRVVA CALSAAREFG SKVFACPSTG NLANAVAAAG ARAGIKTVVF IPSNLEQPKQ
VNSAVFTDSL VAVNGNYDDV NRLASEIAGE EEGWAFVNVN VRPYYAEGSK TLGYEIAEQL
GWRLPDQIVI PVASGSQLTK VHKAFQELIR LGLVEDKPYR VYGAQAAGCS PVSVAYKAGV
DAIRPVKPDT IAKSLAIGNP ADGIYVLDIC RETGGAVEDV TDDEIRAGIV LLARTEGIFT
ETAGGTTVAV LKKLVETGQL DTSLETVVIN TGHGLKTLDA VSGTVAPAAT IDPSYPAFAA
TGLA