Gene EcolC_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4121 
Symbol 
ID6066025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4547243 
End bp4548298 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content46% 
IMG OID641603543 
Producthypothetical protein 
Protein accessionYP_001727046 
Protein GI170022092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGA ATTTATTATC TTCCGCTATT ATAGTCGCCA TCATGTCCCT CGGTCTGACG 
GGTTGTGATG ATAAAAAAGC CGAAACAGAA ACGCTCCCGC CTGCCAATAG TCAACCTGCC
GCACCAGCTC CTGAAGCGAA ACCTACTGAA GCTCCCGTTG CAAAAGCAGA AGCTAAACCT
GAAACACCTG CGCAACCGGT GGTCGATGAA CAAGCGGTTT TCGACGAAAA AATGGATGTC
TATATCAAGT GCTACAACAA GTTACAGATC CCGGTACAGC GCAGTCTGGC GCGTTATGCT
GACTGGCTGA AAGATTTTAA ACAGGGGCCT ACCGGTGAAG AGCGTACTGT TTATGGCATC
TACGGCATTA GTGAATCCAA CCTCGCTGAG TGTGAAAAAG GCGTAAAAAG TGCTGTGGCG
TTAACGCCTG CGCTGCAACC AATTGATGGC GTGGCGGTGA GTTATATCGA TGCTGCCGTG
GCGTTGGGTA ATACCATTAA CGAAATGGAT AAATATTACA CCCAGGAAAA TTATAAAGAC
GATGCGTTTG CGAAAGGGAA AACGCTGCAC CAGACATTCT TAAAAAATCT GGAAGCCTTT
GAACCTGTAG CCGAATCTTA TCATGCGGCG ATTCAGGAAA TTAATGATAA GCGTCAGCTT
GCGGAACTGA AAAATATTGA AGAAAGAGAA GGGAAAACAT TCCACTACTA CTCTCTGGCA
GTCATGATTT CAGCGAAACA AATTAATAAC CTGATATCGC AAGATAAGTT TGATGCAGAA
GCGGCAATGA AGAAAGTGTC TGAACTGGAA ACGCTGGTGG CGCAGGCCAA AGAAGCGGAT
AAAGGCGGCA TGAATTTCTC GTTTATTAAT TCGGCAGGCC AGTATCAACT CGAGGCTAAA
AAATACGTTC GCCGCATCAG AGATAAAGTC CCGTACTCTG ACTGGGATAA AGAGCAACTT
CAGGACGCAA ACTCAAGCTG GATGGTCGAA GACTCTTTCC CGAGAGCATT ACGCGAGTAC
AACGAAATGG TTGATGACTA TAATAGCCTG CGTTAA
 
Protein sequence
MKRNLLSSAI IVAIMSLGLT GCDDKKAETE TLPPANSQPA APAPEAKPTE APVAKAEAKP 
ETPAQPVVDE QAVFDEKMDV YIKCYNKLQI PVQRSLARYA DWLKDFKQGP TGEERTVYGI
YGISESNLAE CEKGVKSAVA LTPALQPIDG VAVSYIDAAV ALGNTINEMD KYYTQENYKD
DAFAKGKTLH QTFLKNLEAF EPVAESYHAA IQEINDKRQL AELKNIEERE GKTFHYYSLA
VMISAKQINN LISQDKFDAE AAMKKVSELE TLVAQAKEAD KGGMNFSFIN SAGQYQLEAK
KYVRRIRDKV PYSDWDKEQL QDANSSWMVE DSFPRALREY NEMVDDYNSL R