Gene EcolC_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3888 
Symbol 
ID6064356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4265907 
End bp4267559 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content38% 
IMG OID641603302 
Producthypothetical protein 
Protein accessionYP_001726817 
Protein GI170021863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGCGC AGCTTTTTGA GCAGTTGTTT CAATCGATAG ACTCTACACT GATCACCAAT 
ATTTTCATCT GGGCTGTTAT ATTCGTATTT TTATCAGCGT GGTGGTGTGA CAAAAAAAAT
ATACATAGTA AGTTTAGAGA ATATGCTCCA ACCTTAATGG GGGCATTAGG TATTCTGGGT
ACTTTCATTG GTATTATTAT TGGTTTACTC AATTTTAATA CCGAAAGTAT TGATACCAGC
ATCCCCGTAT TATTAGGTGG CCTAAAAACA GCATTCATTA CAAGCATTGT AGGTATGTTT
TTTGCCATTT TATTTAATGG AATGGATGCT TTCTTTTTTG CCAATAAACG AAGTGCGTTA
GCTGAAAATA ACCCTGAATC TGTTACACCT GAACATATCT ATCATGAATT AAAAGAGCAG
AACCAGACTC TGACTAAATT AGTCTCGGGT ATTAACGGTG ATAGTGAAGG TTCTCTTATT
GCTCAAATAA AATTACTACG TACTGAGATT AGCGATTCCT CGCAGGCACA ATTAGCTAAT
CACACTCATT TCAGTAATAA GCTTTGGGAA CAACTTGAAC AATTTGCAGA TCTAATGGCA
AAAGGTGCTA CAGAACAAAT TATTGATGCT TTGCGACAAG TCATTATTGA TTTTAATGAA
AATTTAACTG AACAGTTTGG TGAAAACTTT AAAGCTCTTG ATGCCTCTGT AAAAAAACTT
GTTGAGTGGC AGGGAAATTA TAAAACGCAA ATTGAGCAGA TGTCAGAACA ATATCAACAA
AGTGTCGAGT CCCTGGTTGA AACAAAAACT GCGGTTGCAG GGATTTGGGA AGAATGTAAA
GAAATTCCTC TGGCTATGTC TGAACTGCGT GAAGTGCTTC AGGTGAACCA ACATCAAATC
AGCGAACTCT CCCGCCATTT AGAAACCTTT GTCGCCATCC GCGATAAAGC TACAACCGTA
TTACCTGAAA TACAGAACAA AATGGCTGAA GTGGGTGAAC TGCTGAAATC CGGAGCTGCA
AATGTTAGTG CATCTCTTGA GCAAACCAGC CAGCAAATAC TTCTTAATGC AGATTCAATG
CGCGTTGCCC TGGATGAAGG TACCGAAGGA TTCAGACAAT CGGTTACCCA AACACAACAA
GCATTTGCCT CGATGGCGCA TGATGTCAGC AATTCCTCCG AAACCCTAAC CAGCACGTTA
GGTGAAACAA TTACTGAAAT GAAACAAAGT GGTGAAGAAT TCCTGAAATC ACTAGAGTCG
CACTCGAAAG AATTGCATAG AAATATGGAA CAAAATACGA CGAATGTGAT TGATATGTTC
AGTAAGACTG GTGAAAAGAT TAACCATCAA CTATCCAGTA ATGCCGATAA TATGTTTGAT
TCAATCCAGA CATCATTTGA TAAGGCTGGT GCAGGGCTGA CTTCTCAAGT CAGAGAATCA
ATTGAAAAAT TTGCTCTATC CATCAACGAG CAGTTACATG CTTTTGAGCA AGCAACTGAA
CGTGAAATGA ACCGTGAAAT GCAATCATTA GGTAATGCTC TGCTTTCAAT CAGCAAAGGT
TTTGTCGGTA ACTATGAAAA ACTTATTAAA GATTACCAAA TAGTTATGGG GCAGTTACAA
GCATTAATTT CTGCTAATAA ACATCGAGGG TAA
 
Protein sequence
MLAQLFEQLF QSIDSTLITN IFIWAVIFVF LSAWWCDKKN IHSKFREYAP TLMGALGILG 
TFIGIIIGLL NFNTESIDTS IPVLLGGLKT AFITSIVGMF FAILFNGMDA FFFANKRSAL
AENNPESVTP EHIYHELKEQ NQTLTKLVSG INGDSEGSLI AQIKLLRTEI SDSSQAQLAN
HTHFSNKLWE QLEQFADLMA KGATEQIIDA LRQVIIDFNE NLTEQFGENF KALDASVKKL
VEWQGNYKTQ IEQMSEQYQQ SVESLVETKT AVAGIWEECK EIPLAMSELR EVLQVNQHQI
SELSRHLETF VAIRDKATTV LPEIQNKMAE VGELLKSGAA NVSASLEQTS QQILLNADSM
RVALDEGTEG FRQSVTQTQQ AFASMAHDVS NSSETLTSTL GETITEMKQS GEEFLKSLES
HSKELHRNME QNTTNVIDMF SKTGEKINHQ LSSNADNMFD SIQTSFDKAG AGLTSQVRES
IEKFALSINE QLHAFEQATE REMNREMQSL GNALLSISKG FVGNYEKLIK DYQIVMGQLQ
ALISANKHRG