Gene EcolC_3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3721 
Symbol 
ID6065928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4072713 
End bp4074806 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content44% 
IMG OID641603138 
Producthypothetical protein 
Protein accessionYP_001726658 
Protein GI170021704 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTA CAGAAGCCCG CCTACTCGAT TTTTTAAAAC GTTCGCAGCA GTTTGTTATT 
CCCATTTATC AACGGACTTA TTCATGGACA GAACAACAAT GTCGGCAACT TTGGGACGAC
ATCATTCGTG CCGGAAAGCG TGACGATATA TCAGCGCATT TTATCGGTTC GGTTGTTTAT
ATTGAGCAGG GGTTGTATCA GGTTTCTGGT ATTTCTCCGT TACTGGTCAT TGATGGTCAA
CAACGGCTGA CGACCGCAAT GTTGCTGATT GAGGCTTTAT CGCGCCATCT TGGCGAAGAC
GAAGTTTTTG ATGGCTTTTC AGCAATGAAA TTGCGTAATT ATTATTTGCT CAATCCTTAT
GAGTCCGGCG AGAAAGGTTT TAAATTACTA CTGACCGAGA CTGATAAAGA CAGTTTACTG
GCGTTAATAA AACAAAGACC AATGCCAGAA AACTATTCCC ATCGAATAAT GGAAAACTTT
ACTTTCTTTG ATGAACAAAT TGCCAAACTC GGTGATGACT TGATCCCCTT ATGTCGTGGG
TTAGCAAAGT TATTAATTGT CGATGTGGCG CTTAATCGTG GTCAGGATAA TCCGCAACTG
ATTTTTGAAA GTATGAACTC CACCGGTAAG GCGTTAAGTC AGGCCGATCT GGTGCGCAAT
TTTATTCTGA TGGGCCTCGA ACCAGAGCAT CAAACCCGGT TGTATGAAGA TCACTGGCGT
CCAATGGAAG TCGCCTTTGG TCAGCAAGGT TACAGCGAAT ATTTTGACAG TTTTATGCGT
CATTATCTGA CGGTAAAAAC GGGGGAGATC CCTCGGACAG ATGAAGTCTA TGAGGCATTT
AAACTCCATG CCCGCAGCCA GAGTGTTGCT GAAAAAGGCG TAGATCGGCT GGTTGAAGAT
ATTCATATCT ACGCGGAGTA TTACTGTGCA ATGGCATTGG GAAAAGAAAG TGACAAATCG
CTTGCTACGG CTTTTCAGGA TTTGCGCGAG TTAAAGGTTG ATGTGGCGTA TCCTTTCTTA
CTGGCGTTTT ATCATGACTA TAAAAATGGC GATTTGTCTC ACGAAGATTT TCTGAGCATA
ATTCGTTTAA TTGAATCTTA TGTTTTCCGC CGTGCAGTAT GTGCAATTCC GACGAATTCT
TTGAACAAGA CATTTGCCAC TTTTTATAAA GTTATTAATA AAGAAAATTA TCTGGAAAGT
ATTCAGGTAC ATTTTTTGAA TCTACCTTCA TATCGTCGTT TCCCCAACGA TGATGAATTT
AAACGGGAAT TAAAAGTTCG CGATCTCTAT AACTTCCGTA GTCGCAGCTA CTGGTTACGA
CGACTGGAAA ACGATAAACG CAGAGAGCGC GTGGAAGAGT TTACGATTGA ACATATTATG
CCGCAGAACG AAAATTTGTC GGCTAAATGG CGCGAAGAGC TGGGAAGCGA CTGGCAGCGT
ATTCATAAAG AATTGTTGCA TACGTTGGGG AATCTCACTT TAACGCGCTA TAACTCCCGC
TACAGTGACA GACCTTTCGC GGAAAAACGC GATATTGAAG ACGGCTTTAA GCATAGCCCG
CTTTATTTGA ATATCGGTCT TGGACAGTGC GAAAAATGGG ATGAAGCCGC CATTCACGCC
CGAGCCGATC GTCTGGCCGA TCTCGCGGTT CAGGTCTGGC AAGCGCCTTC TCTTCCTGAA
GAGGTTTTAG CTGTTTATCG GGGACAGCCT GAGAACAAAA CCAGTTACAG CCTGAGTGAT
TATCCTTTTC TTGCTGATGG TTCGCATAGC CGGGTGTTAT TCGATCATCT TCGCGATGAA
GTTATGCGCC TGGACGCAGG GATCACGCAG GAAGTTTTAA AGCTGTATAT TGCGTTTAAA
GCTGAAACGA ATTTTGTTGA TGTTGTGCCG CAAAAAAGCC GACTGCGATT GTCGCTTAAT
ATGCAGTTTC ATGAACTGGT CGATCCGAAA GGTATTGCCA AAGATGTGAC AAATGTTGGG
CGCTGGGGCA ATGGCGATGT GGAAATTGGT TTCAGCGACC TCGCACAACT TCCTTACATT
ATGGGATTAA TTCGTCAGGC ATTTGAAAAA CAGATGGAGA GCGCGTTGGT ATAA
 
Protein sequence
MKATEARLLD FLKRSQQFVI PIYQRTYSWT EQQCRQLWDD IIRAGKRDDI SAHFIGSVVY 
IEQGLYQVSG ISPLLVIDGQ QRLTTAMLLI EALSRHLGED EVFDGFSAMK LRNYYLLNPY
ESGEKGFKLL LTETDKDSLL ALIKQRPMPE NYSHRIMENF TFFDEQIAKL GDDLIPLCRG
LAKLLIVDVA LNRGQDNPQL IFESMNSTGK ALSQADLVRN FILMGLEPEH QTRLYEDHWR
PMEVAFGQQG YSEYFDSFMR HYLTVKTGEI PRTDEVYEAF KLHARSQSVA EKGVDRLVED
IHIYAEYYCA MALGKESDKS LATAFQDLRE LKVDVAYPFL LAFYHDYKNG DLSHEDFLSI
IRLIESYVFR RAVCAIPTNS LNKTFATFYK VINKENYLES IQVHFLNLPS YRRFPNDDEF
KRELKVRDLY NFRSRSYWLR RLENDKRRER VEEFTIEHIM PQNENLSAKW REELGSDWQR
IHKELLHTLG NLTLTRYNSR YSDRPFAEKR DIEDGFKHSP LYLNIGLGQC EKWDEAAIHA
RADRLADLAV QVWQAPSLPE EVLAVYRGQP ENKTSYSLSD YPFLADGSHS RVLFDHLRDE
VMRLDAGITQ EVLKLYIAFK AETNFVDVVP QKSRLRLSLN MQFHELVDPK GIAKDVTNVG
RWGNGDVEIG FSDLAQLPYI MGLIRQAFEK QMESALV