Gene EcSMS35_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3005 
Symbol 
ID6145245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3087745 
End bp3088956 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID641617874 
Productpeptidase 
Protein accessionYP_001745025 
Protein GI170684131 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR03320] M20/DapE family protein YgeY
[TIGR03526] putative selenium metabolism hydrolase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGA ATATTCCATT CAAACTGATT CTTGAAAAAG CAAAAGATTA CCAGGCGGAC 
ATGACTCGCT TCCTGCGCGA CATGGTTGCT ATTCCCAGTG AAAGCTGCGA CGAAAAACGC
GTAGTACATC GTATTAAAGA AGAGATGGAA AAAGTCGGCT TCGATAAAGT TGAAATCGAC
CCGATGGGCA ACGTTCTCGG TTATATCGGC CACGGCCCGC GTCTGGTGGC AATGGACGCT
CATATTGATA CCGTCGGCAT TGGCAACATC AAAAACTGGG ACTTCGATCC GTACGAAGGC
ATGGAAACTG ATGAGCTGAT CGGTGGTCGC GGTACTTCCG ACCAGGAAGG CGGCATGGCA
TCTATGGTTT ATGCCGGTAA AATCATTAAA GACCTCGGTC TGGAAGATGA ATATACCCTG
CTGGTTACCG GTACTGTGCA GGAAGAAGAC TGCGACGGTC TGTGCTGGCA GTACATTATT
GAACAATCCG GCATTCGCCC GGAATTTGTG GTCAGTACCG AACCAACCGA CTGCCAGGTA
TACCGTGGTC AGCGCGGTCG TATGGAAATT CGTATTGATG TTCAGGGTGT TAGCTGCCAC
GGTTCTGCGC CAGAACGCGG TGACAACGCC ATTTTCAAAA TGGGTCCGAT TCTTGGCGAA
TTACAAGAAC TCTCCCAACG TCTGGGTTAT GACGAATTCC TCGGCAAAGG CACCCTCACC
GTTTCTGAAA TCTTCTTCAC ATCCCCAAGC CGTTGCGCTG TAGCAGATAG CTGCGCCGTC
TCTATTGACC GCCGTCTGAC CTGGGGCGAA ACCTGGGAAG GCGCGCTGGA CGAAATCCGC
GCCCTGCCTG CAGTACAGAA AGCTAACGCG GTTGTTTCTA TGTACAACTA CGACCGTCCG
TCCTGGACTG GCCTGGTTTA CCCAACCGAA TGCTACTTCC CGACCTGGAA AGTGGAAGAA
GATCACTTCA CCGTTAAAGC ACTGGTGAAT GCCTACGAAG GTCTGTTTGG CAAAGCGCCG
GTTGTTGATA AGTGGACCTT CTCAACTAAC GGCGTATCTA TCATGGGCCG TCACGGCATT
CCGGTGATCG GCTTTGGCCC AGGTAAAGAA CCTGAAGCGC ATGCACCTAA CGAAAAAACC
TGGAAATCTC ACCTGGTGAC CTGTGCCGCG ATGTACGCGG CAATCCCGTT AAGCTGGCTG
GCAACCGAAT AA
 
Protein sequence
MAKNIPFKLI LEKAKDYQAD MTRFLRDMVA IPSESCDEKR VVHRIKEEME KVGFDKVEID 
PMGNVLGYIG HGPRLVAMDA HIDTVGIGNI KNWDFDPYEG METDELIGGR GTSDQEGGMA
SMVYAGKIIK DLGLEDEYTL LVTGTVQEED CDGLCWQYII EQSGIRPEFV VSTEPTDCQV
YRGQRGRMEI RIDVQGVSCH GSAPERGDNA IFKMGPILGE LQELSQRLGY DEFLGKGTLT
VSEIFFTSPS RCAVADSCAV SIDRRLTWGE TWEGALDEIR ALPAVQKANA VVSMYNYDRP
SWTGLVYPTE CYFPTWKVEE DHFTVKALVN AYEGLFGKAP VVDKWTFSTN GVSIMGRHGI
PVIGFGPGKE PEAHAPNEKT WKSHLVTCAA MYAAIPLSWL ATE