Gene EcolC_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3821 
Symbol 
ID6065981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4177451 
End bp4178515 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content52% 
IMG OID641603233 
Productputative L-ascorbate 6-phosphate lactonase 
Protein accessionYP_001726752 
Protein GI170021798 
COG category[R] General function prediction only 
COG ID[COG2220] Predicted Zn-dependent hydrolases of the beta-lactamase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00194518 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAG TGAAAAGTAT CACCCGTGAA TCCTGGATCC TGAGCACTTT CCCGGAGTGG 
GGTAGCTGGT TGAATGAAGA AATTGAACAA GAACAGGTCG CTCCTGGCAC ATTTGCGATG
TGGTGGCTTG GCTGCACCGG GATCTGGTTG AAATCGGAAG GTGGCACCAA CGTTTGCGTT
GATTTCTGGT GCGGCACTGG CAAACAAAGT CACGGTAACC CGTTAATGAA ACAGGGTCAC
CAGATGCAGC GCATGGCTGG CGTGAAAAAA CTGCAGCCAA ACCTGCGTAC CACCCCGTTT
GTTCTTGATC CGTTTGCGAT TCGCCAGATC GACGCGGTAC TGGCGACTCA CGATCACAAC
GATCATATCG ACGTTAACGT CGCTGCTGCC GTGATGCAGA ACTGTGCTGA TGACGTACCG
TTTATCGGAC CGAAAACCTG TGTGGATTTG TGGATTGGCT GGGGCGTACC GAAAGAGCGT
TGCATCGTGG TCAAACCGGG CGATGTAGTA AAAGTGAAAG ACATTGAAAT TCATGCGCTT
GATGCTTTCG ACCGTACTGC ACTGATCACC CTGCCTGCCG ATCAAAAAGC GGCTGGCGTA
CTGCCAGATG GCATGGACGA TCGCGCGGTG AACTACCTGT TCAAAACGCC TGGCGGCTCC
CTGTATCACA GCGGCGACTC CCACTACTCT AACTACTATG CAAAACATGG TAATGAGCAT
CAGATCGACG TGGCTTTAGG TTCATACGGC GAAAATCCGC GTGGTATCAC CGACAAAATG
ACCAGCGCCG ATATGCTGCG TATGGGTGAA GCGCTGAATG CGAAAGTAGT GATCCCGTTC
CACCACGATA TCTGGTCAAA CTTCCAGGCC GATCCGCAAG AGATCCGCGT GCTGTGGGAG
ATGAAAAAAG ATCGCCTGAA GTATGGCTTC AAGCCGTTTA TCTGGCAGGT TGGCGGCAAA
TTTACCTGGC CGTTGGATAA AGACAACTTC GAGTACCACT ATCCGCGTGG TTTCGATGAT
TGCTTCACTA TTGAACCGGA TCTGCCGTTC AAGTCATTCC TGTAA
 
Protein sequence
MSKVKSITRE SWILSTFPEW GSWLNEEIEQ EQVAPGTFAM WWLGCTGIWL KSEGGTNVCV 
DFWCGTGKQS HGNPLMKQGH QMQRMAGVKK LQPNLRTTPF VLDPFAIRQI DAVLATHDHN
DHIDVNVAAA VMQNCADDVP FIGPKTCVDL WIGWGVPKER CIVVKPGDVV KVKDIEIHAL
DAFDRTALIT LPADQKAAGV LPDGMDDRAV NYLFKTPGGS LYHSGDSHYS NYYAKHGNEH
QIDVALGSYG ENPRGITDKM TSADMLRMGE ALNAKVVIPF HHDIWSNFQA DPQEIRVLWE
MKKDRLKYGF KPFIWQVGGK FTWPLDKDNF EYHYPRGFDD CFTIEPDLPF KSFL