Gene EcolC_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2303 
Symbol 
ID6067006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2540058 
End bp2541119 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID641601706 
Producthypothetical protein 
Protein accessionYP_001725265 
Protein GI170020311 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000471514 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CGTTAAAACC ACGTATTGAT TTCGACGGTC CTCTGGAGGT CGATCAGAAT 
CCTAAATTCA GGGCGCAGCA GACCTTTGAC GAAAATCAGG CGCAAAATTT TGCCCCGGCC
ACGCTCGACG AAGCGCAGGA AGAAGAGGGG CAAGTCGAAG CGGTAATGGA CGCAGCGTTA
CGTCCGAAAC GCAGCCTGTG GCGCAAAATG GTGATGGGCG GGCTGGCTCT GTTTGGCGCA
AGCGTTGTCG GGCAGGGTGT ACAGTGGACA ATGAATGCCT GGCAAACCCA GGACTGGGTG
GCGCTGGGTG GATGTGCCGC TGGGGCATTG ATTATCGGCG CTGGCGTAGG TTCTGTGGTA
ACAGAGTGGC GGCGCTTATG GCGCTTGCGA CAGCGCGCCC ATGAACGCGA CGAAGCGCGT
GATTTATTGC ATAGCCACGG CACGGGCAAA GGCCGCGCAT TTTGCGAAAA ACTGGCGCAG
CAGGCGGGTA TTGATCAGTC GCATCCGGCG CTGCAACGCT GGTATGCCTC AATCCATGAA
ACGCAAAACG ACCGTGAAGT GGTCAGCCTG TATGCGCATT TGGTCCAGCC AGTTTTAGAT
GCCCAGGCGC GGCGCGAAAT CAGCCGTTCG GCGGCGGAAT CAACGTTGAT GATTGCGGTC
AGCCCGCTGG CGTTGGTCGA TATGGCGTTT ATCGCCTGGC GCAATCTGCG TTTAATTAAT
CGCATCGCCA CGCTGTATGG CATTGAACTG GGGTATTACA GCCGTTTGCG TCTGTTTAAG
CTGGTATTGC TGAATATCGC TTTTGCCGGA GCCAGCGAAC TGGTGCGCGA AGTGGGGATG
GACTGGATGT CGCAAGATCT CGCTGCTCGT TTGTCTACCC GCGCAGCTCA GGGGATTGGT
GCAGGACTTC TGACGGCACG ACTCGGGATT AAAGCTATGG AGCTTTGCCG CCCGCTGCCG
TGGATTGACG ATGACAAACC TCGCCTCGGG GATTTCCGTC GTCAGCTTAT CGGTCAGGTG
AAAGAAACGC TGCAAAAAGG CAAAACGCCC AGCGAAAAAT AA
 
Protein sequence
MTEPLKPRID FDGPLEVDQN PKFRAQQTFD ENQAQNFAPA TLDEAQEEEG QVEAVMDAAL 
RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV
TEWRRLWRLR QRAHERDEAR DLLHSHGTGK GRAFCEKLAQ QAGIDQSHPA LQRWYASIHE
TQNDREVVSL YAHLVQPVLD AQARREISRS AAESTLMIAV SPLALVDMAF IAWRNLRLIN
RIATLYGIEL GYYSRLRLFK LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG
AGLLTARLGI KAMELCRPLP WIDDDKPRLG DFRRQLIGQV KETLQKGKTP SEK