Gene EcolC_0836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0836 
Symbol 
ID6066008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp902768 
End bp903979 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID641600241 
Productpeptidase 
Protein accessionYP_001723835 
Protein GI170018881 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR03320] M20/DapE family protein YgeY
[TIGR03526] putative selenium metabolism hydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGA ATATTCCATT CAAACTGATT CTTGAAAAAG CAAAAGATTA CCAGGCGGAC 
ATGACTCGCT TCCTGCGCGA CATGGTTGCT ATTCCCAGTG AAAGCTGCGA CGAAAAACGC
GTAGTACATC GTATTAAAGA AGAGATGGAA AAAGTCGGCT TCGATAAAGT TGAAATCGAC
CCGATGGGCA ACGTTCTCGG TTATATCGGC CACGGCCCGC GTCTGGTGGC AATGGACGCT
CATATCGATA CCGTCGGCAT TGGCAACATC AAAAACTGGG ACTTCGATCC GTACGAAGGC
ATGGAAACTG ATGAGCTAAT CGGTGGTCGC GGTACTTCCG ACCAGGAAGG CGGCATGGCA
TCTATGGTTT ATGCCGGTAA AATCATTAAA GACCTCGGTC TGGAAGATGA ATATACCCTG
CTGGTTACCG GTACTGTGCA GGAAGAAGAC TGCGACGGTC TGTGCTGGCA GTACATTATT
GAACAATCCG GCATTCGCCC GGAATTTGTG GTCAGTACCG AACCAACCGA CTGCCAGGTA
TACCGTGGTC AGCGCGGTCG TATGGAAATT CGTATTGATG TTCAGGGTGT TAGCTGCCAC
GGTTCTGCGC CAGAACGCGG TGACAACGCC ATTTTCAAAA TGGGTCCGAT TCTTGGCGAA
TTACAAGAAC TCTCCCAACG TCTGGGTTAT GACGAATTCC TCGGCAAAGG CACCCTCACC
GTTTCTGAAA TCTTCTTCAC ATCCCCAAGC CGTTGCGCTG TAGCAGATAG CTGCGCCGTC
TCTATTGACC GCCGTCTGAC CTGGGGCGAA ACCTGGGAAG GCGCGCTGGA CGAAATCCGC
GCCCTGCCTG CAGTACAGAA AGCTAACGCG GTTGTTTCTA TGTACAACTA CGACCGTCCG
TCCTGGACTG GCCTGGTTTA CCCAACCGAA TGCTACTTCC CGACCTGGAA AGTGGAAGAA
GATCACTTCA CCGTTAAAGC ACTGGTGAAT GCCTACGAAG GGCTGTTTGG CAAAGCGCCG
GTTGTTGATA AGTGGACCTT CTCAACTAAC GGCGTATCTA TCATGGGCCG TCACGGCATT
CCGGTGATCG GCTTTGGCCC GGGTAAAGAA CCTGAAGCGC ATGCACCTAA CGAAAAAACC
TGGAAATCTC ACCTGGTGAC CTGTGCCGCG ATGTACGCTG CAATCCCGTT AAGCTGGCTG
GCAACCGAAT AA
 
Protein sequence
MAKNIPFKLI LEKAKDYQAD MTRFLRDMVA IPSESCDEKR VVHRIKEEME KVGFDKVEID 
PMGNVLGYIG HGPRLVAMDA HIDTVGIGNI KNWDFDPYEG METDELIGGR GTSDQEGGMA
SMVYAGKIIK DLGLEDEYTL LVTGTVQEED CDGLCWQYII EQSGIRPEFV VSTEPTDCQV
YRGQRGRMEI RIDVQGVSCH GSAPERGDNA IFKMGPILGE LQELSQRLGY DEFLGKGTLT
VSEIFFTSPS RCAVADSCAV SIDRRLTWGE TWEGALDEIR ALPAVQKANA VVSMYNYDRP
SWTGLVYPTE CYFPTWKVEE DHFTVKALVN AYEGLFGKAP VVDKWTFSTN GVSIMGRHGI
PVIGFGPGKE PEAHAPNEKT WKSHLVTCAA MYAAIPLSWL ATE