Gene Namu_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2096 
Symbol 
ID8447707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2313952 
End bp2315064 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content73% 
IMG OID645041219 
Productthreonine synthase 
Protein accessionYP_003201463 
Protein GI258652307 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000119841 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0436227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAC CCGTGTCGGA CGCGAGCGTG ACCGGGATCG CCGGCGGGGC CGGGGCCGGG 
TGGCCGGGGC TGATCGCCGC CTACGCCGAC CGGGTCGCCG TGCCGCCCGG GGCCCGGGTG
GTCACGTTGC TGGAGGGCGG CACGCCGCTG CTGCCGGCGC ACACCCTGTC CGACCGGCTG
GGCGTGCAGG TCTACCTCAA GGTCGAGGGG GCCAACCCGA CCGGCTCGTT CAAGGACCGG
GGGATGACGG TCGCGGTCAC CCACGCGCTG GCCCGCGGTG CCCGCGCGGT GATCTGCGCC
TCCACCGGCA ACACCTCGGC CTCGGCGGCC GCCTATGCGG CCCGGGCCGG ACTGACCAGC
GCGGTGCTGA TCCCGCAGGG CAAGATCGCC AGCGGCAAGC TGGCCCAGGC CGTGGCCTAC
GGGGCCCGGA TCCTGCAGGT CGAGGGCAAC TTCGACGACT GCCTGGAGCT GGCCCGCAAG
ACCGCGGCCA CCACCGACGA GATCGAGCTG GTGAACTCGG TCAACCCGGT GCGGATCGAA
GGGCAGAAGA CCGCCGCGTT CGAGATCTGC GACGTGCTGG GCCGGGCCCC GGACGTGCAC
TTCCTGCCGG TCGGCAACGC GGGCAACATC ACCGCGTACT GGAAGGGGTA CCGCGAGTAC
CACGCGGACG GCGTCATCGA CGCTCTGCCC CGGATGTTCG GCTTCCAGGC CGCCGGCGCC
GCGCCGCTGG TGCTGGGCCA TCCGGTGCGC GACCCGGACA CCATCGCGAC CGCGATCCGG
ATCGGCGCCC CGGCGTCTTG GAGCGGGGCG ATCGGCGCCC GGGACGAGTC CGGCGGTCTG
ATCGACATGG TCACCGACGA CCAGATCCTG GACGCCTACC GGCTGCTCGC CTCGACCGAG
GGCGTCTTCG TCGAGCCCGC GTCGGCCGCG TCGGTGGCCG GGCTGACCGC CACCGTCGCC
GACGGCCGGT TGCCGGCCGG GTCGCTGGTG GTCTGCACCG TCACCGGCAA CGGGCTCAAG
GACCCGGACA CCGCGATGTC GTTCATGACC GAACCGGTCG TCCTGCCGGT CGCGGCCGAG
GCGGTCACCG ACGCCCTGGG GCTCACCGGA TGA
 
Protein sequence
MDAPVSDASV TGIAGGAGAG WPGLIAAYAD RVAVPPGARV VTLLEGGTPL LPAHTLSDRL 
GVQVYLKVEG ANPTGSFKDR GMTVAVTHAL ARGARAVICA STGNTSASAA AYAARAGLTS
AVLIPQGKIA SGKLAQAVAY GARILQVEGN FDDCLELARK TAATTDEIEL VNSVNPVRIE
GQKTAAFEIC DVLGRAPDVH FLPVGNAGNI TAYWKGYREY HADGVIDALP RMFGFQAAGA
APLVLGHPVR DPDTIATAIR IGAPASWSGA IGARDESGGL IDMVTDDQIL DAYRLLASTE
GVFVEPASAA SVAGLTATVA DGRLPAGSLV VCTVTGNGLK DPDTAMSFMT EPVVLPVAAE
AVTDALGLTG