Gene EcolC_2583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2583 
Symbol 
ID6065458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2835778 
End bp2836869 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content55% 
IMG OID641601990 
Productluciferase family protein 
Protein accessionYP_001725541 
Protein GI170020587 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03612] pyrimidine utilization protein A 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.126668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG GCGTATTCGT ACCTATTGGC AACAACGGCT GGCTCATTTC GACCCACGCG 
CCGCAGTACA TGCCGACCTT TGAACTGAAT AAAGCCATCG TGCAAAAAGC GGAGCACTAC
CATTTCGATT TCGCCCTGTC GATGATCAAA CTGCGTGGCT TTGGCGGCAA AACTGAGTTC
TGGGATCACA ACCTTGAGTC GTTCACCTTG ATGGCGGGGC TGGCGGCCGT GACCTCGCGC
ATTCAGATTT ACGCCACCGC CGCCACCTTA ACGTTACCTC CAGCAATCGT CGCCCGTATG
GCCGCAACCA TCGACTCAAT CTCTGGCGGG CGTTTTGGCG TCAACCTCGT GACTGGCTGG
CAAAAGCCCG AGTATGAGCA GATGGGTATC TGGCCTGGCG ATGACTATTT CTCCCGTCGT
TACGACTATC TCACCGAGTA TGTTCAGGTG CTGCGCGACC TGTGGGGCAC GGGAAAAAGC
GATTTTAAAG GCGATTTTTT CACCATGAAT GATTGTCGCG TCAGTCCGCA ACCGAGTGTC
CCTATGAAAG TGATCTGCGC CGGGCAAAGC GACGCTGGCA TGGCGTTCTC CGCCCAGTAT
GCCGATTTCA ACTTCTGTTT CGGCAAAGGC GTAAATACAC CCACGGCTTT CGCCCCGACC
GCTGCGCGGA TGAAACAGGC CGCAGAGCAA ACCGGGCGCG ACGTTGGCTC TTATGTATTG
TTTATGGTGA TTGCCGATGA AACCGACGAT GCCGCTCGCG CCAAATGGGA ACACTACAAA
GCGGGCGCGG ATGAAGAGGC GTTAAGCTGG CTAACCGAAC AAAGTCAGAA AGATACCCGC
TCAGGTACTG ACACCAACGT CCGTCAGATG GCCGATCCCA CTTCGGCGGT AAACATCAAT
ATGGGGACGT TAGTCGGTTC TTACGCCAGT GTCGCGCGCA TGTTAGATGA AGTCGCAAGC
GTGCCTGGTG CCGAAGGCGT GCTGTTAACC TTCGACGATT TTCTGTCGGG AATCGAAACC
TTCGGCGAGC GCATTCAACC ACTGATGCAG TGCCGCGCCC ATCTCCCTGT GCTGACTCAG
GAGGTGGCAT GA
 
Protein sequence
MKIGVFVPIG NNGWLISTHA PQYMPTFELN KAIVQKAEHY HFDFALSMIK LRGFGGKTEF 
WDHNLESFTL MAGLAAVTSR IQIYATAATL TLPPAIVARM AATIDSISGG RFGVNLVTGW
QKPEYEQMGI WPGDDYFSRR YDYLTEYVQV LRDLWGTGKS DFKGDFFTMN DCRVSPQPSV
PMKVICAGQS DAGMAFSAQY ADFNFCFGKG VNTPTAFAPT AARMKQAAEQ TGRDVGSYVL
FMVIADETDD AARAKWEHYK AGADEEALSW LTEQSQKDTR SGTDTNVRQM ADPTSAVNIN
MGTLVGSYAS VARMLDEVAS VPGAEGVLLT FDDFLSGIET FGERIQPLMQ CRAHLPVLTQ
EVA