Gene EcolC_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2247 
Symbol 
ID6066948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2469142 
End bp2470458 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content50% 
IMG OID641601652 
Productputative dual specificity phosphatase 
Protein accessionYP_001725211 
Protein GI170020257 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAACTG AACGTCGGAA CGTATTGCTA CAAGGCGCTG GCTGGTTATT GTTGCTGGCC 
CCGTTTTTCT TCTTCACCTA TGGATCTCTT AATCAGTTCA CCGCGGTTCA GGACCTTAAC
AGCCATGATA TTCCCAGTCA GGTATTCGGT TGGGAAACGG CGATCCCTTT TCTTCCCTGG
ACTATTGTTC CTTACTGGAG TCTGGATCTT TTATATGGAT TTTCGCTGTT CGTTTGTAGC
ACGACATTCG AACAGCGCCG ACTTGTCCAC CGGCTTATTC TGGCAACGGT AATGGCCTGC
TGCGGTTTTT TTCTCTACCC GCTGAAGTTT AGTTTTATCC GTCCTGAAGT GAGTGGGGTG
ACAGGATGGC TATTTTCGCA ACTTGAACTG TTTGATCTGC CTTATAACCA GTCTCCTTCG
CTGCATATTA TTCTCTGCTG GCTACTTTGG CGTCACTTTC GTCAGCATCT GGCTGTGAGG
TGGCGTAAAG TCTGCGGCGG ATGGTTTTTA CTCATCGCCA TTTCGACGCT GACGACCTGG
CAGCATCATT TTATTGATGT CATCACGGGG CTGGCGGTAG GTATGTTAAT TGACTGGATG
GTGCCCGTCG ACCGTCGTTG GAATTATCAG AAACCTGATC AACGTCGAAT CAAAATAGCA
CTGCCATATG TCGTAGGCGC GGGCTCGTGC ATTGTGTTGA TGGAGCTAAT GATAATGCTT
CAGTTATGGT GGTCAGTCTG GTTATGTTGG CCAGTATTAT CGCTATTCAT CATTGGCCGT
GGGTACGGTG GGCTTGGCGC GATAACAACA GGGAAAGATA GTCAGGGGAA ACTCCCGCCC
GCCGTTTACT GGCTGACATT GCCCTGGCGT ATCGGGATGT GGCTGTCTAT GCGTTGGTCT
TGTCTTCGCC TGGAGCCGGT GAGCAAAATT ACTGCTGGTG TTTATTTAGG GGCGTTTCCA
CGACATATTC CGGCACAGAA TGCGGTTCTG GACGTCACCT TTGAATTCCC TCGCGGACGA
GCCACAAAAG ATCGACTCTA TTTTTGTGTA CCGATGCTGG ATCTGGTGGT TCCGGAAGAG
GGGGAGCTCC GACAGGCCGT GGCGATGCTG GAAACATTAC GCGAAGAGCA AGGCAGCGTT
CTGGTCCATT GTGCATTGGG ATTATCGCGC AGTGCGCTGG TGGTGGCGGC ATGGTTGTTA
TGTTACGGAC ACTGTAAAAC CGTTAATGAA GCGATTAGCT ATATTCGAGC CAGACGCCCG
CAGATTGTGC TGACAGACGA GCACAAAGCG ATGCTGAGAT TATGGGAAAA CAGGTAA
 
Protein sequence
MITERRNVLL QGAGWLLLLA PFFFFTYGSL NQFTAVQDLN SHDIPSQVFG WETAIPFLPW 
TIVPYWSLDL LYGFSLFVCS TTFEQRRLVH RLILATVMAC CGFFLYPLKF SFIRPEVSGV
TGWLFSQLEL FDLPYNQSPS LHIILCWLLW RHFRQHLAVR WRKVCGGWFL LIAISTLTTW
QHHFIDVITG LAVGMLIDWM VPVDRRWNYQ KPDQRRIKIA LPYVVGAGSC IVLMELMIML
QLWWSVWLCW PVLSLFIIGR GYGGLGAITT GKDSQGKLPP AVYWLTLPWR IGMWLSMRWS
CLRLEPVSKI TAGVYLGAFP RHIPAQNAVL DVTFEFPRGR ATKDRLYFCV PMLDLVVPEE
GELRQAVAML ETLREEQGSV LVHCALGLSR SALVVAAWLL CYGHCKTVNE AISYIRARRP
QIVLTDEHKA MLRLWENR