Gene EcSMS35_2516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2516 
SymboldsdA 
ID6145252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2570037 
End bp2571353 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content51% 
IMG OID641617388 
ProductD-serine dehydratase 
Protein accessionYP_001744559 
Protein GI170681826 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3048] D-serine dehydratase 
TIGRFAM ID[TIGR02035] D-serine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.232923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGC TCATCGCCCA GTATCCGTTG GTAGAGGATC TGGTTGCTCT TCAAGAAACC 
ACCTGGTTTA ATCCTGGCAC GACCTCATTG GCTGAAGGTT TACCTTATGT TGGCCTGACC
GAACAGGATG TTCAGGACGC CCATGCGCGC TTATCCCGTT TTGCACCCTA TCTGGCAAAA
GCATTTCCTG AAACTGCTGC TACTGGGGGG ATTATTGAAT CAGAACTGGT TGCCATTCCG
GCTATGCAAA AACGGCTGGA AAAGGAATAT CAACAACCGA TCAGCGGGCA ACTGCTACTG
AAAAAAGATA GCCATTTGCC CATTTCCGGC TCCATAAAAG CACGCGGCGG GATTTATGAA
GTCCTGGCAC ATGCAGAAAA ACTGGCTCTG GAAGCGGGGT TGCTGACGCT TGAAGATGAC
TACAGCAAAC TGCTTTCCCC GGAGTTTAAA CAGTTCTTTA GTCAATACAG CATTGCTGTG
GGCTCAACCG GAAATCTGGG GTTATCAATC GGCATAATGA GCGCCCGCAT TGGCTTTAAG
GTGACAGTGC ATATGTCTGC TGATGCCCGG ACGTGGAAAA AAGCGAAACT GCGCAGCCAT
GGCGTTACAG TCGTGGAATA TGAGCAAGAT TATGGTGTTG CTGTCGAGGA AGGACGTAAA
GCAGCGCAGT CTGACCCGAA CTGTTTCTTT ATTGATGACG AGAATTCCCG CACGTTGTTC
CTTGGGTATT CCGTCGCTGG CCAGCGTCTT AAAGCGCAAT TTGCCCAGCA AGGTCGTATC
GTCGATGCTG ATAACCCTCT GTTTGTCTAT CTGCCGTGTG GTGTTGGTGG TGGTCCTGGT
GGCGTCGCAT TCGGACTTAA ACTGGCGTTT GGCGATCATG TTCACTGCTT TTTTGCCGAA
CCAACGCACT CCCCTTGTAT GTTGTTAGGC GTCCATACAG GATTACACGA TCAGATTTCT
GTTCAGGATA TTGGTATCGA CAACCTTACC GCAGCGGATG GCCTTGCAGT TGGTCGCGCA
TCAGGCTTTG TCGGGCGGGC GATGGAGCGT CTGCTGGATG GCTTCTATAC CCTTAGCGAT
CAAACCATGT ATGACATGCT TGGCTGGCTG GCGCAGGAAG AAGGTATTCG TCTTGAACCT
TCGGCACTGG CGGGTATGGC TGGGCCTCAG CGCGTGTGCG CATCAGTAAG TTACCAACAG
ATGCACGGTT TCAGCGCAGA ACAACTGCAT AATGCCACTC ATCTGGTGTG GGCGACGGGA
GGTGGAATGG TGCCGGAAGA AGAGATGGAG CAATATCTGG CAAAAGGCCG TTCATAA
 
Protein sequence
MNSLIAQYPL VEDLVALQET TWFNPGTTSL AEGLPYVGLT EQDVQDAHAR LSRFAPYLAK 
AFPETAATGG IIESELVAIP AMQKRLEKEY QQPISGQLLL KKDSHLPISG SIKARGGIYE
VLAHAEKLAL EAGLLTLEDD YSKLLSPEFK QFFSQYSIAV GSTGNLGLSI GIMSARIGFK
VTVHMSADAR TWKKAKLRSH GVTVVEYEQD YGVAVEEGRK AAQSDPNCFF IDDENSRTLF
LGYSVAGQRL KAQFAQQGRI VDADNPLFVY LPCGVGGGPG GVAFGLKLAF GDHVHCFFAE
PTHSPCMLLG VHTGLHDQIS VQDIGIDNLT AADGLAVGRA SGFVGRAMER LLDGFYTLSD
QTMYDMLGWL AQEEGIRLEP SALAGMAGPQ RVCASVSYQQ MHGFSAEQLH NATHLVWATG
GGMVPEEEME QYLAKGRS