Gene EcE24377A_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1182 
SymbolsolA 
ID5588901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1196932 
End bp1198050 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content53% 
IMG OID640924881 
ProductN-methyltryptophan oxidase 
Protein accessionYP_001462293 
Protein GI157155462 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT 
GCAACCCGCG CCGGTTTAAA CGTGCTAATG ACCGACGCCC ATATGCCACC GCATCAACAC
GGCAGCCACC ACGGCGATAC GCGATTAATT CGCCATGCTT ATGGTGAAGG CGAAAAGTAT
GTCCCGCTGG TCCTCCGCGC GCAAACGCTG TGGGATGAAC TTTCCCGCCA CAACGAAGAT
GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTCGGCC CGGCTGACTC CGCATTTCTC
GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TCGAAAAGCT CGACGCGCAA
GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG CTTATTTGAG
ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG
GAAGCGGGCT GTGCGCAGCT GTTCAACTGC CCGGTCACCG CAATTCGTCA TGACGATGAT
GGCGTAACTA TTGAAACGGC TGACGGAGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG
GGAACATGGG TAAAAGACCT GCTCCCGGAG CTGCCTGTCC AGCCCGTACG CAAAGTATTT
GCCTGGTATC AGGCCGATGG TCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT
GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT
GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA
GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTATTGCC GGGTATCGGT
TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCGC CTGACGAAGA TTTTATTATC
GATACCCTAC CCGACCACGA TAATACACTG TTCATTACCG GCCTGAGTGG GCACGGTTTT
AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT
TTTGATTTAA CGCCATTCAG GCTTTCCCGC TTCCAATAA
 
Protein sequence
MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY 
VPLVLRAQTL WDELSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEKLDAQ
GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTAIRHDDD
GVTIETADGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG
ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG
CCLYGAACTY DNSPDEDFII DTLPDHDNTL FITGLSGHGF KFASVLGEIA ADFAQDKKSD
FDLTPFRLSR FQ