Gene EcolC_0170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0170 
Symbol 
ID6068246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp184179 
End bp185870 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content50% 
IMG OID641599570 
Productphosphoethanolamine transferase 
Protein accessionYP_001723179 
Protein GI170018225 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACA TCAAATCGAT TACACAGCAG AAGCTGAGCT TTTTGCTTGC AATCTATATT 
GGCCTTTTTA TGAATGGCGC GGTTTTTTAC CGCCGCTTCG GCAGCTATGC GCACGATTTT
ACCGTCTGGA AAGGCATTTC TGCTGTTGTT GAACTGGCCG CCACCGTACT GGTGACCTTC
TTTTTACTAC GTCTTCTTTC GCTGTTTGGC CGCCGCAGCT GGCGTATTCT GGCATCGCTG
GTGGTGCTCT TTTCCGCAGG TGCCAGCTAT TACATGACCT TCCTTAATGT GGTCATTGGT
TATGGCATCA TCGCTTCCGT CATGACCACC GATATCGACC TGTCAAAAGA AGTTGTTGGT
CTGAACTTTA TTCTCTGGTT AATCGCCGTT AGTGCATTGC CTCTTATCCT TATCTGGAAT
AACCGCTGTC GCTACACCTT GCTCCGACAA CTGCGAACCC CAGGGCAGCG TATTCGCAGC
CTGGCGGTCG TCGTACTGGC GGGTATTATG GTTTGGGCAC CGATTCGTTT GCTGGATATC
CAGCAGAAGA AAGTGGAGAG GGCGACCGGC GTTGATTTGC CGAGTTATGG CGGTGTCGTA
GCGAACTCTT ATCTGCCATC AAACTGGCTT TCTGCGTTGG GGCTGTATGC CTGGGCGCGG
GTCGATGAAT CTTCCGATAA TAATTCATTG CTTAATCCGG CGAAGAAATT CACTTATCAG
GCACCGCAAA ACGTTGATGA CACTTATGTC GTGTTTATCA TCGGTGAAAC CACGCGTTGG
GACCATATGG GTATTTTCGG CTATGAGCGT AATACCACGC CGAAACTGGC CCAGGAGAAA
AATCTGGCGG CGTTCCGTGG TTACTCCTGT GATACCGCAA CCAAACTCTC ACTGCGTTGC
ATGTTTGTAC GTCAGGGGGG CGCGGAAGAT AATCCGCAGC GCACATTAAA AGAACAGAAC
ATTTTCGCGG TTCTGAAGCA GTTAGGATTC AGTTCTGACC TCTACGCTAT GCAGAGCGAA
ATGTGGTTCT ACAGCAACAC GATGGCGGAC AACATTGCTT ATCGTGAGCA GATTGGTGCG
GAGCCACGTA ATCGTGGCAA GCCGGTAGAT GATATGTTGC TGGTAGACGA AATGCAGCAA
TCGCTAGGGC GCAACCCGGA TGGTAAGCAT CTGATCATTC TGCATACCAA AGGTTCGCAT
TTTAACTACA CCCAGCGTTA TCCGCGTAGC TTCGCGCAGT GGAAGCCGGA ATGTATTGGT
GTTGATAGCG GCTGTACCAA AGCGCAGATG ATCAACTCCT ATGACAACTC GGTGACCTAT
GTGGATCACT TTATCTCCAG CGTGATTGAT CAGGTTCGCG ATAAGAAAGC GATTGTGTTC
TACGCAGCTG ACCACGGTGA GTCAATTAAT GAACGCGAGC ACCTGCACGG CACGCCGCGT
GAACTGGCAC CGCCGGAGCA GTTCCGCGTA CCGATGATGG TCTGGATGTC AGATAAATAT
CTGGAAAATC CGGCCAATGC GCAGGCGTTT GCGCAGCTGA AAAAAGAAGC CGACATGAAA
GTGCCACGCC GTCACGTAGA GCTGTACGAT ACCATCATGG GTTGTCTTGG CTATACTTCA
CCGGATGGTG GAATTAACGA AAACAACAAC TGGTGTCACA TCCCGCAGGC AAAAGAGGCA
GCGGCTAACT AA
 
Protein sequence
MRYIKSITQQ KLSFLLAIYI GLFMNGAVFY RRFGSYAHDF TVWKGISAVV ELAATVLVTF 
FLLRLLSLFG RRSWRILASL VVLFSAGASY YMTFLNVVIG YGIIASVMTT DIDLSKEVVG
LNFILWLIAV SALPLILIWN NRCRYTLLRQ LRTPGQRIRS LAVVVLAGIM VWAPIRLLDI
QQKKVERATG VDLPSYGGVV ANSYLPSNWL SALGLYAWAR VDESSDNNSL LNPAKKFTYQ
APQNVDDTYV VFIIGETTRW DHMGIFGYER NTTPKLAQEK NLAAFRGYSC DTATKLSLRC
MFVRQGGAED NPQRTLKEQN IFAVLKQLGF SSDLYAMQSE MWFYSNTMAD NIAYREQIGA
EPRNRGKPVD DMLLVDEMQQ SLGRNPDGKH LIILHTKGSH FNYTQRYPRS FAQWKPECIG
VDSGCTKAQM INSYDNSVTY VDHFISSVID QVRDKKAIVF YAADHGESIN EREHLHGTPR
ELAPPEQFRV PMMVWMSDKY LENPANAQAF AQLKKEADMK VPRRHVELYD TIMGCLGYTS
PDGGINENNN WCHIPQAKEA AAN