Gene EcSMS35_2214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2214 
SymbolserC 
ID6143213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2229953 
End bp2231041 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content51% 
IMG OID641617090 
Productphosphoserine aminotransferase 
Protein accessionYP_001744264 
Protein GI170683386 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1932] Phosphoserine aminotransferase 
TIGRFAM ID[TIGR01364] phosphoserine aminotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.183325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.123491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA TCTTCAATTT TAGTTCTGGT CCGGCAATGC TACCGGCAGA GGTGCTTAAA 
CAGGCTCAAC AGGAACTGCG CGACTGGAAC GGTCTTGGTA CGTCGGTGAT GGAAGTGAGT
CACCGGGGCA AAGAGTTCAT TCAGGTTGCA GAGGAAGCCG AGAAGGATTT TCGCGATCTT
CTTAATGTCC CCTCCAACTA CAAGGTATTG TTCTGCCATG GCGGCGGTCG CGGTCAGTTT
GCTGCTGTTC CGTTGAATAT TCTCGGTGAT AAAACCACTG CAGATTATGT TGATGCCGGT
TACTGGGCGG CAAGTGCCAT TAAAGAAGCG AAAAAATACT GCACGCCTAA TGTCTTTGAC
GCCAAAGTGA CTGTTGATGG TCTGCGCGCG GTTAAGCCAA TGCGCGAATG GCAACTCTCT
GATAATGCTG CTTATATGCA TTATTGCCCG AATGAAACCA TCGACGGTAT CGCCATCGAC
GAAACGCCAG ACTTCGGCAA AGATGTGGTG GTCGCCGCCG ACTTCTCTTC AACCATTCTT
TCCCGTCCGA TTGACGTCAG CCGTTATGGC GTGATTTACG CTGGCGCGCA GAAAAATATC
GGCCCGGCAG GCCTGACAAT CGTCATCGTT CGTGAAGATT TGCTGGGCAA AGCGAATATC
GCGTGTCCGT CGATTCTGGA TTATTCCATC CTCAACGATA ACGGCTCCAT GTTTAACACG
CCGCCGACAT TTGCCTGGTA TCTGTCTGGT CTGGTCTTTA AATGGCTGAA AGCGAACGGC
GGTGTAGCTG AAATGGATAA AATCAATCAG CAAAAAGCAG AACTGCTGTA TGGGGTGATT
GATAACAGCG ATTTCTACCG CAATGACGTG GCGAAAGCTA ACCGTTCGCG GATGAACGTG
CCGTTCCAGT TGGCGGACAG TGCGCTTGAC AAATTGTTCC TTGAAGAGTC TTTTGCTGCT
GGCCTTCATG CGCTGAAAGG TCACCGTGTG GTCGGCGGAA TGCGTGCGTC TATTTATAAC
GCCATGCCAC TGGAAGGCGT TAAAGCGCTG ACAGACTTCA TGGTTGAGTT CGAACGCCGT
CACGGCTAA
 
Protein sequence
MAQIFNFSSG PAMLPAEVLK QAQQELRDWN GLGTSVMEVS HRGKEFIQVA EEAEKDFRDL 
LNVPSNYKVL FCHGGGRGQF AAVPLNILGD KTTADYVDAG YWAASAIKEA KKYCTPNVFD
AKVTVDGLRA VKPMREWQLS DNAAYMHYCP NETIDGIAID ETPDFGKDVV VAADFSSTIL
SRPIDVSRYG VIYAGAQKNI GPAGLTIVIV REDLLGKANI ACPSILDYSI LNDNGSMFNT
PPTFAWYLSG LVFKWLKANG GVAEMDKINQ QKAELLYGVI DNSDFYRNDV AKANRSRMNV
PFQLADSALD KLFLEESFAA GLHALKGHRV VGGMRASIYN AMPLEGVKAL TDFMVEFERR
HG