Gene EcolC_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2237 
Symbol 
ID6067319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2456175 
End bp2457194 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content49% 
IMG OID641601642 
ProductLysR family transcriptional regulator 
Protein accessionYP_001725201 
Protein GI170020247 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.956085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCTT TAATTGTTAA TAATATTTTG CAATCAAGTT ATCATAATCA AACAACTTCA 
CTTGTCAGCG ACACCGCTTC GTTTTTAACA TCGCTTATGG AAAAAAATAG TCTGTTTAGT
CAGCGCATCC GTTTGCGCCA CCTTCATACA TTCGTAGCTG TCGCACAACA AGGAACTTTG
GGGCGCGCGG CTGAAACCCT TAATTTGAGT CAACCTGCGC TCTCTAAGAC ATTGAATGAA
CTGGAGCAGC TGACGGGCGC TCGCTTGTTT GAGCGTGGTC GTCAGGGGGC GCAACTTACC
TTACCCGGCG AACAATTTTT AACGCATGCA GTCAGAGTTC TTGACGCCAT CAACACTGCC
GGACAGTCGC TTCATCGTAA AGAAGGTCTT AATAATGATG TCGTCAGGGT TGGTGCACTA
CCTACTGCGG CACTGGGGAT ATTACCTTCG GTTATAGGTC AGTTTCATCA GCAACAAAAA
GAAACGACCT TGCAAGTTGC GACAATGAGT AACCCTATGA TTCTGGCGGG TTTGAAAACC
GGGGAAATCG ATATCGGCAT TGGTCGGATG TCAGATCCTG AACTGATGAC CGGGCTTAAT
TACGAACTGC TGTTTCTTGA ATCGCTGAAG CTGGTTGTCC GCCCTAATCA CCCGCTACTT
CAGGAGAACG TAACGCTAAG CCGGGTGCTG GAATGGCCGG TCGTTGTATC ACCAGAAGGC
ACTGCGCCAC GCCAGCATTC AGATGCATTA GTACAGAGCC AGGGATGTAA AATTCCTTCG
GGTTGTATCG AAACGCTGTC TGCTTCGCTA TCTCGTCAAC TTACGGTTGA ATACGACTAC
GTGTGGTTTG TCCCTTCTGG CGCGGTAAAA GACGACCTGC GTCATGCCAC GCTGGTGGCC
CTGCCTGTTC CGGGACATGG TGCAGGCGAA CCGATTGGAA TACTGACCCG CGTAGATGCG
ACGTTCTCTT CTGGTTGCCA GTTGATGATT AACGCTATTC GAAAATCAAT GCCGTTCTGA
 
Protein sequence
MIALIVNNIL QSSYHNQTTS LVSDTASFLT SLMEKNSLFS QRIRLRHLHT FVAVAQQGTL 
GRAAETLNLS QPALSKTLNE LEQLTGARLF ERGRQGAQLT LPGEQFLTHA VRVLDAINTA
GQSLHRKEGL NNDVVRVGAL PTAALGILPS VIGQFHQQQK ETTLQVATMS NPMILAGLKT
GEIDIGIGRM SDPELMTGLN YELLFLESLK LVVRPNHPLL QENVTLSRVL EWPVVVSPEG
TAPRQHSDAL VQSQGCKIPS GCIETLSASL SRQLTVEYDY VWFVPSGAVK DDLRHATLVA
LPVPGHGAGE PIGILTRVDA TFSSGCQLMI NAIRKSMPF