Gene Caci_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1200 
Symbol 
ID8332535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1354785 
End bp1355840 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content74% 
IMG OID644954347 
Productthreonine synthase 
Protein accessionYP_003111966 
Protein GI256390402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.887008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTGGC GAGGCGTCAT CGAGGAGTAC CGACAGTGGC TGCCGGTCGA CGCCGGCACG 
CCCGTGGTCA CGCTGGGCGA GGGCGGCACG CCGCTGCTGC CCGCGCCGCG CCTGTCCGCC
CTCACCGGGT GCGAGGTGTT CGTCAAGGTC GAGGGCATGA ACCCGACCGG CTCGTTCAAG
GACCGCGGCA TGACCACCGC GATCTCGCTG GCCAAGCAGG CCGGCGCCGA GGCCGTGGTG
TGCGCCTCCA CCGGCAACAC CAGCTCCTCG GCCGCCGCCT ACGCGGTGCG CGGCGGGCTC
AAGCCGGTGG TGCTGGTGCC GGCCGGCAAG ATCGCGCTGG GCAAGCTGGC CCAGGCGCTG
GCGCACGGCG CGACCCAGCT GCCGGTCGAG GGCAACTTCG ACGACTGTCT CCGGCTGGCC
CGCGAGCTGG CCGCGAAGTA CCCGGTCGCG CTGGTGAACT CGGTGAACCC GGTCCGGCTG
CACGGTCAGA AGACCGCCGC GTTCGAGGTC GTCGACGTGC TCGGCGACGC CCCGGACATC
CACGCCCTGC CGGTCGGCAA CGCCGGCAAC ATCTCCGCGT ACTGGCTCGG GTATCAGGAG
TACGCGAAGG AGGGTCAGGC CTCGCGCACG CCCCGCATGT TCGGCTTCCA GGCCGCCGGC
GCCGCCCCGC TCGTGCACGG CGCGCCGGTC CCGGACCCGG ACACCATCGC CACCGCCATC
CGCATCGGCA ACCCCGCGTC CTGGGACCTG GCGATCGCCG CGCGCGAGGA CTCCTCCGGC
GTCATCGAGG CGGTGACCGA CGAGGAGATC CTCGCCGCGC ACCGTGTGCT CTCGGCCGAG
GAGGGCGTGT TCGTCGAGCC GGCCTCCGCC GCCGGCGTCG CGGGCATCCT CAAGCTCGCC
CGAGCCGGCC GCCTCGAGTC CGGCAAGCGC ATCGTGGTCA CGGTCACCGG CCACGGTCTG
AAGGACCCCG AGTGGGCGGT CAAGGCCGCG CCGCCGCTGC CGGACGCCGT CCCGGCGGAG
GTCGCCGCGG TCGCCGAGGC TCTCAGCCTC AGCTAA
 
Protein sequence
MAWRGVIEEY RQWLPVDAGT PVVTLGEGGT PLLPAPRLSA LTGCEVFVKV EGMNPTGSFK 
DRGMTTAISL AKQAGAEAVV CASTGNTSSS AAAYAVRGGL KPVVLVPAGK IALGKLAQAL
AHGATQLPVE GNFDDCLRLA RELAAKYPVA LVNSVNPVRL HGQKTAAFEV VDVLGDAPDI
HALPVGNAGN ISAYWLGYQE YAKEGQASRT PRMFGFQAAG AAPLVHGAPV PDPDTIATAI
RIGNPASWDL AIAAREDSSG VIEAVTDEEI LAAHRVLSAE EGVFVEPASA AGVAGILKLA
RAGRLESGKR IVVTVTGHGL KDPEWAVKAA PPLPDAVPAE VAAVAEALSL S