Gene Sare_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4032 
Symbol 
ID5705012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4587996 
End bp4589045 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID641273457 
Productthreonine synthase 
Protein accessionYP_001538813 
Protein GI159039560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.969378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000421032 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGCGGG GCCTGATCGA CACGTACCGG GATCGGCTGC CGGTCACCGC GGCCACCCCG 
GTCGTCACCC TGCACGAGGG GAACACCCCG CTGCTGCCGG CACCGTTGCT GTCGGCGCGG
ACGGGGTGCG ACGTCTACCT GAAGGTTGAG GGTGCCAATC CGACCGGTTC CTTCAAGGAC
CGGGGGATGA CCGTCGCCGT CTCCAAAGCG GTCGAGGACG GCAACAAGGT GATCATCTGT
GCCTCGACCG GTAACACCAG TGCCTCGGCC GCCGCGTACG CGGCGCGAGC CGGTCTGGTC
TGTGCGGTAC TGGTGCCGCA GGGCAAGATC GCCTTGGGCA AGCTCGCTCA GGCGTTGGTG
CACGGTGCCC GGCTGCTTCA GGTCAGCGGC AACTTCGACG ACTGCCTGTC GTTGGCCGCC
AAGCTCGCCC AGGACTACCC GGTCGCCCTG GTGAACTCGG TGAACACCGA CCGCCTGCAC
GGCCAGAAGA CCGCCGCGTT CGAGATCGTC GAGGCGCTCG GCGACGCGCC CGACATCCAC
TGCATGCCGG TAGGAAACGC GGGCAACATT TCCGCCTACT GGCTCGGCTA CTCGGAGGAA
CGGGCGGCGG GCAACGTCTC CCGGGTCCCG AAGCTCTTCG GGTTCCAGGC CGCTGGCGCC
GCGCCGATCG TCACCGGTCA GGCGGTTCGG GAACCCGCCA CGATCGCCAC CGCGATCCGG
ATCGGCAATC CGGCGAGCTG GACGAGAGCG CTGGACGCCC GGGACTCCTC GGGCGGCCTG
ATCGCCGCGG TCACCGACCG GGAGATTCTG ACCGCGTACC GGTTGCTCGC TCGGGAGGTC
GGGGTGTTCG TCGAGCTGGG CAGTGCGGCG AGTGTCGCTG GGCTGCTCCA GCAGGCCGCC
GTGGGCAAGG TGCCGGCTGG GTCGACGATT GTCTGTACGG TCACCGGACA TGGCCTGAAG
GATCCGGAGT GGGCCATCTC GACCGCCCCC GCGCCGGTGA CCATCGCCAA CGACCCCCTG
GCCGCGGCCC GCTCTCTCGA TCTGGTCTGA
 
Protein sequence
MWRGLIDTYR DRLPVTAATP VVTLHEGNTP LLPAPLLSAR TGCDVYLKVE GANPTGSFKD 
RGMTVAVSKA VEDGNKVIIC ASTGNTSASA AAYAARAGLV CAVLVPQGKI ALGKLAQALV
HGARLLQVSG NFDDCLSLAA KLAQDYPVAL VNSVNTDRLH GQKTAAFEIV EALGDAPDIH
CMPVGNAGNI SAYWLGYSEE RAAGNVSRVP KLFGFQAAGA APIVTGQAVR EPATIATAIR
IGNPASWTRA LDARDSSGGL IAAVTDREIL TAYRLLAREV GVFVELGSAA SVAGLLQQAA
VGKVPAGSTI VCTVTGHGLK DPEWAISTAP APVTIANDPL AAARSLDLV