Gene EcolC_3890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3890 
Symbol 
ID6064358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4268277 
End bp4269380 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content39% 
IMG OID641603304 
Producthypothetical protein 
Protein accessionYP_001726819 
Protein GI170021865 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT CTATCGACAT TTCAGAACTT ATTCAATTAG GGAAGAAAAT GTTACCAGAA 
GGAGTCGATT TTTTTCTGGA TGAATCCCCT ATTGACTTTG ATCCTATAGA TATTGAGTTA
TCCACGGGTA AAGAAGTTAG TATCGAAGAT CTTGACCCTG GTAGCGGGCT TATCTCTTAT
CATGGCCGCC AGGTTCTTTT ATATATTCGG GACCATTCAG GGCGTTATGA TGCGGCTATC
GTAGATGGCG AAAAAGGAAA ACGTTTTCAT ATTGCCTGGT GCAGAACTCT TGATGAAATG
CGCCATAAAA ATCGATTTGA AAGGTATCAT GCAACTAACC GCATAGATGG TTTATTCGAA
ATTGATGATG GTTCAGGTCG GAGCCAGGAT GTTGATTTAC GGGTATGTAT GAATTGCCTC
GAACGACTTA ATTATAAAGG AAGTATTGAT AAACAACGAA AAAGAGAGAT TTTTAAATCA
TTCTCATTAA ATGAGTTTTT TTCAGATTAT AGTACCTGTT TTCGTCATAT GCCTAAGGGT
ATCTATGACA AAACAAATAG TGGGTATGTC GAAAACTGGA AGGAAATATC TAAAGAAATA
CGAGAAAAGG CAAATTATGT TTGTAATGAT TGTGGCGTGA ATTTATCAAC CGCCAAAAAC
TTGTGCCATG TCCATCATAA AAATGGCATC AAATATGATA ATCACCATGA AAACCTTCTT
GTTCTGTGCA AGGATTGCCA TCGAAAACAG CCCCTCCACG AAGGTATATT CGTTACCCAA
GCAGAGATGG CTATCATTCA ACGTTTACGT TCCCAACAAG GGTTATTAAA AGCAGAATCC
TGGAATGAAA TATATGACCT GACTGATCCA TCAGTGCATG GTGATATTAA TATGATGCAA
CATAAAGGCT TTCAACCTCC TGTTCCTGGG TTAGATCTTC AAAACTCAGA ACATGAAATT
ATTGCAACCG TAGAAGCTGC ATGGCCAGGC CTTAAAATTG CAGTTAACCT TACTCCCGCC
GAAGTCGAAG GATGGAGAAT ATATACCGTG GGTGAGCTGG TTAAAGAAAT ACAAACCGGA
GCCTTTACGC CAGCAAAATT GTAA
 
Protein sequence
MKLSIDISEL IQLGKKMLPE GVDFFLDESP IDFDPIDIEL STGKEVSIED LDPGSGLISY 
HGRQVLLYIR DHSGRYDAAI VDGEKGKRFH IAWCRTLDEM RHKNRFERYH ATNRIDGLFE
IDDGSGRSQD VDLRVCMNCL ERLNYKGSID KQRKREIFKS FSLNEFFSDY STCFRHMPKG
IYDKTNSGYV ENWKEISKEI REKANYVCND CGVNLSTAKN LCHVHHKNGI KYDNHHENLL
VLCKDCHRKQ PLHEGIFVTQ AEMAIIQRLR SQQGLLKAES WNEIYDLTDP SVHGDINMMQ
HKGFQPPVPG LDLQNSEHEI IATVEAAWPG LKIAVNLTPA EVEGWRIYTV GELVKEIQTG
AFTPAKL