Gene EcolC_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0052 
Symbol 
ID6068439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp52323 
End bp54089 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content31% 
IMG OID641599455 
Producthypothetical protein 
Protein accessionYP_001723065 
Protein GI170018111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAG GTACTATAGT ATCTATATTG ATTCGTATAT TGAAAGAGAA ACGTGATGGT 
CTGATATTAA TAAATGGGGA ATGGGGTGTA GGTAAGACAT ATTTCCTTCG AACTGAATTT
AGAACACTTT ATTCAGATAC GAGTCATTTT TATTTGTCTG TTCTTGGGTT AAACAGTTTA
CAAGATTTTA AAGATAGAAT GCTAAGCATA ACGTATCTAA ACACCCCTTC AGAGATAAAA
AAACTTGGGG ATTTAACCTC AAGCGCTGCA TCAGCATTAA CCCAAGACGA AAGCACTGGA
AAGTTGACAG AACAAATTAT ATCTACCATT TCAGGTGCAA TGAGAGATTA TGTACTTAAG
GATCTTTCAG GGGTTTTTGT CATTGATGAT CTAGAAAGAA TCCCTCAATC TTTGAGAGAT
GAAATAGCAA CCTTTTGCCT ACAAAGTTAT CAAAATGATA ATCGGTTAGA CTTTATTTTA
GTGGGTAACT TTTCAAAGCA GAGTAGTGAG GTATTAAGTC ATAAAGAAAA AGTTGTAAGC
GACGAAATAT ATTTCTCTAT TAATAACCTT ACCGATATAT TAGAGCAAAA ACTGGCTCCA
TTAGAAGAGA GACATAAATA TTTAATCACT CAGGTTATTA TTGGGTTCGA AGAAACAAAC
CTACGAATTA TTAATAGAGT AATTTCAAAA TTGACACCTC TTTTTGAAAA ACAAGAACCT
GAGCAAAAAA TCTCTGATAT AGACATCAAA AACCTAGTCA CTTCACTTTG TGCTCATATA
ATACTAAAAG AGAAATTTTC ATATCAAGAG AATGATTTTC ATCATAATTA TATCACATCT
TCTTTCAAAA CACTTACGAC TTCATCTGAG AATGATCCAG ATAAAATAAG CGAAGAAGAG
AGTAACCTTT TAAATATCAC TGCTCATATG ACTTATAACA ACTTAATGGT TCCATATTGT
TTTAATGAGA TATCTCAAAA GGATATAATT CCATACATAT TCAATTCACA AGAACCTTTA
AAAAAAAGTG ATTATGCCAC ATTAAAACAA CCGGAATGGT ATAATATACC TGAAAATGAT
TATTTGGATG AAATTAAAAA AGTAATACTA AAAACTTCAT CACCGACACT ATCTACTTGG
CTAATCGCTA CAAACAACTA TATTAGACTC TCAAAATCAA AATACATCCC TCGCATAAGA
GGGTTAACCA ACAAAACCAT TGAAAAAAAC AAACGTAGCT TTAGTAATAA AGAAATAAAA
GAATATTTCC TAGAATCAAA TCCCAATATT GATAATATTC CACCACATAT ATTAAGGAGA
GAAGGAAATG AACTTCATAA TTACTTCCTT GATAAATATT GCGATATAAT AAAGGAGGAG
AAAATAAAAG AATTGAAAGA AAAAATGAAT GTTAACGGTT GGAGTGCTAT TGATATGGAT
ATTTATCAAT CAAAATTCAA ATTTAATCCA CTTGAAACAT TAGATGTAAA CCTAATCATA
CATGGAATAA AAAACACTTG GTCCATTCGC GATATTCAGT TGTTTTCAAA CCATCTATCA
TCACTCTATA ACTTCTCAAA TCTTGCGGAC TACCTTTCTG CTGAACTACC ATACCTTAAG
AAGCTACATT CAGCCATAAA CGCTCATCAT AAAAAAATTA ACAGTTCTTT TCGACGCGGA
GCCATAATTG AACTAACAGA ATGCGTTAAA CGCATAAAAG AAGCTTTAGA ACAAAGCATC
GCTTTAAAAG AAGACGCATC GCAATAA
 
Protein sequence
MTKGTIVSIL IRILKEKRDG LILINGEWGV GKTYFLRTEF RTLYSDTSHF YLSVLGLNSL 
QDFKDRMLSI TYLNTPSEIK KLGDLTSSAA SALTQDESTG KLTEQIISTI SGAMRDYVLK
DLSGVFVIDD LERIPQSLRD EIATFCLQSY QNDNRLDFIL VGNFSKQSSE VLSHKEKVVS
DEIYFSINNL TDILEQKLAP LEERHKYLIT QVIIGFEETN LRIINRVISK LTPLFEKQEP
EQKISDIDIK NLVTSLCAHI ILKEKFSYQE NDFHHNYITS SFKTLTTSSE NDPDKISEEE
SNLLNITAHM TYNNLMVPYC FNEISQKDII PYIFNSQEPL KKSDYATLKQ PEWYNIPEND
YLDEIKKVIL KTSSPTLSTW LIATNNYIRL SKSKYIPRIR GLTNKTIEKN KRSFSNKEIK
EYFLESNPNI DNIPPHILRR EGNELHNYFL DKYCDIIKEE KIKELKEKMN VNGWSAIDMD
IYQSKFKFNP LETLDVNLII HGIKNTWSIR DIQLFSNHLS SLYNFSNLAD YLSAELPYLK
KLHSAINAHH KKINSSFRRG AIIELTECVK RIKEALEQSI ALKEDASQ