Gene Noca_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0979 
Symbol 
ID4599755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1028088 
End bp1029302 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content74% 
IMG OID639775581 
Productthreonine dehydratase 
Protein accessionYP_922188 
Protein GI119715223 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGG TCCCGACGGT CGGGCTGGCC GACATCGAGG AGGCCCGCCG GGTCCTGGCC 
GGCGTCGCGA TCCAGACCCC GATGGAGGAG TCCCGCTGGC TCTCGGCGAT CGCCGGCGGG
CCGGTGTGGC TCAAGTGCGA GAACCTCCAG CGCACCGGGT CCTTCAAGCC CCGCGGCGCC
TACGTGCGCA TCTCCCGGCT CACCCCCGAG GAGCGGGCCC GCGGGGTCGT GGCGGCCTCG
GCGGGCAACC ACGCGCAGGG CGTGGCGCTG GCCGCGCAGC TGCTCGGCAT CAAGGCCACC
GTCTTCATGC CCGAGGGGGC GCCGATCCCC AAGGAGAAGG CGACCCGCGG GTACGGCGCG
GAGGTGCTCT TCCACGGCCG GTACCTCGAG GACGCGCTGG CCGAGGCGAC CGTGTTCGCC
GAGCGCACCG GCGCGGTGCT GATCCACCCC TTCGACCACG CCGACGTCGT CGCCGGCCAG
GGCACGGCCG GCCTCGAGAT CCTCGAGCAG GCGCCCGACC TGCAGACGGT GCTGGTCCCC
ACCGGCGGTG GTGGGCTGCT GGCGGGGGTC GCGATCGCGG TGAAGGCGCG GCGCCCCGAC
GTCCGGGTGA TCGGGGTGCA GGCGGCCGGC GCCGCGGCGT ACCCCGGCTC GCTCGCCGAG
GGGCACCCCG TCGCCCTGAC CTCGATGAAG ACGATGGCCG ACGGCATCGC CGTGGGCCTC
CCGGGGCAGG TCACCTTCGC GGCGGTGCGC GACCACGTCG ACGAGATCGT CACGGTCTCC
GAGAACTCGC TGTCCCGCTC GGTGCTGGCC GTGCTGGAGC GCGCGAAGAT GCTGGTCGAG
CCCGCCGGAG CGGCCGCGGT CGCCGCCGTG CTGGACCGGC CGGACATCTT CGCGACCCCC
GCGGTGGTCG TGCTCTCGGG CGGCAACATC GACCCCCTGC TGCTCGGCAA GGTGATCCGG
CACGGCATGG CGGCCGCCGG CCGCTACCTG AACCTACGGG TCTGCATCCC CGACCTGCCG
GGCGGGCTCG CGCAGCTGCT CACCGACATC TCCGCGGTCG GAGCGAACGT GCTCGAGGTC
GCGCACGAGC GGATCTCACC CACGCTGAAC CTCGACGAGG TCGAGGTGCA CGTCCAGCTC
GAGACCCGCG GGGAGCCGCA CACCGCGCAG GTGCTGGCGC GCCTGCGCGA GCGCGGCTAC
CGCGTGTACG AGTAG
 
Protein sequence
MTEVPTVGLA DIEEARRVLA GVAIQTPMEE SRWLSAIAGG PVWLKCENLQ RTGSFKPRGA 
YVRISRLTPE ERARGVVAAS AGNHAQGVAL AAQLLGIKAT VFMPEGAPIP KEKATRGYGA
EVLFHGRYLE DALAEATVFA ERTGAVLIHP FDHADVVAGQ GTAGLEILEQ APDLQTVLVP
TGGGGLLAGV AIAVKARRPD VRVIGVQAAG AAAYPGSLAE GHPVALTSMK TMADGIAVGL
PGQVTFAAVR DHVDEIVTVS ENSLSRSVLA VLERAKMLVE PAGAAAVAAV LDRPDIFATP
AVVVLSGGNI DPLLLGKVIR HGMAAAGRYL NLRVCIPDLP GGLAQLLTDI SAVGANVLEV
AHERISPTLN LDEVEVHVQL ETRGEPHTAQ VLARLRERGY RVYE