Gene Sde_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3644 
SymbolmetX 
ID3966601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4620725 
End bp4621870 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID637922741 
Producthomoserine O-acetyltransferase 
Protein accessionYP_529111 
Protein GI90023284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCTC CCTTTCCAGA AGATTCCGTA GGGTTAGTTA CACCGCAGCT CTTTCATTTT 
GATCAGCCGT TGGCACTTGC CAACGGCCGC TCGTTAGATT CGTACGAGTT AATGGTAGAA
ACCTATGGCG AATTAAATGC AGAAAAAAGC AACGGTATAT TAATCTGCCA CGCGCTTTCT
GGCAGCCATC ACGTGGCTGG CTATCACAGC GAAGACGACA AAAAACCCGG CTGGTGGGAA
CACTACGTAG GCCCGGGCAA GCCAATAGAC ACCAACCGCT TCTATGTGGT GTGCATGAAT
AACATTGGCG GCTGCCACGG CTCCACGGGC CCGCAATCTA TCAACCCCGC CACAGGTAAA
CCCTGGGGCA GCAGCTTTCC GTTTTTGCGC GTGCGCGACT GGGTAGAAAC TCAAGTGCGT
CTCGCCGACC GCCTCGGCAT TAACCAGTGG GCCGCAGTTG TGGGCGGCAG CCTAGGCGGC
ATGCAAGCCA TGCGCTGGGC GCTCGAGCAC CCCAATCGCC TGCGCCACTG CGTTGTTATC
GCCTCGGCAA TGAATTTAAC CGCACAAAAT ATTGCCTTTA ATGAAACCGC ACGGCAGGCC
ATTCAAAGCG ACCCCAACTT TTTTAATGGC GATTACCTCG CACAAAGCAC CCTGCCAAAG
CGCGGCTTAA GTGTTGCGCG CATGATCGGC CATATTACCT ATTTGTCTGA CGATGGCATG
GGCAAAAAGT TTGGCCGCGA ATTGCGCAGC GGCAGTTTCG ATCGCGGCAA TGACCAACTG
GTAGAATTCC AAATAGAAAG TTACTTGCGC TACCAAGGCA GCACTTTTTC TGAAGTATTC
GACGCAAACA CCTATATATT AATGACCCGC GCATTGGATT ACTTTGACCT TGCGCGCGAG
TACAACGGCG ATGCAGCCGA AGCGTTCAAA CAAGCTAGCT GCAAATTTTT AGTGGTTTCT
TTCACCTCAG ACTGGCGCTT TGCCCCAGAG CGTTCGCGCG AGATTGTAAC CGCGCTTATG
CGCGCAAACC GCGACGTAGT GTACGGCGAA ATTGACTCTG TGCACGGCCA CGATGCGTTT
CTTGTGCCCA ATCAATTGCG TTACTGGGAG CTATTCTCTG CTTATATGAA CCGGGTGGAG
GTGTAA
 
Protein sequence
MPAPFPEDSV GLVTPQLFHF DQPLALANGR SLDSYELMVE TYGELNAEKS NGILICHALS 
GSHHVAGYHS EDDKKPGWWE HYVGPGKPID TNRFYVVCMN NIGGCHGSTG PQSINPATGK
PWGSSFPFLR VRDWVETQVR LADRLGINQW AAVVGGSLGG MQAMRWALEH PNRLRHCVVI
ASAMNLTAQN IAFNETARQA IQSDPNFFNG DYLAQSTLPK RGLSVARMIG HITYLSDDGM
GKKFGRELRS GSFDRGNDQL VEFQIESYLR YQGSTFSEVF DANTYILMTR ALDYFDLARE
YNGDAAEAFK QASCKFLVVS FTSDWRFAPE RSREIVTALM RANRDVVYGE IDSVHGHDAF
LVPNQLRYWE LFSAYMNRVE V