Gene EcolC_1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1639 
Symbol 
ID6065580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1819885 
End bp1820862 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content44% 
IMG OID641601053 
ProductDeoR family transcriptional regulator 
Protein accessionYP_001724623 
Protein GI170019669 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.294849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00400864 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAAGA GAATGACGAA ACAGAAAAAA AACAGCCGCT GGCAGGGTTA CGATCCTCGC 
TACATATACT CGCTCGTGGT CAGACGTTAT TTTGCGGATA TGAAAACCAA GATTGAGATT
GCAGAGGAGC TTGGCGTTTC CCGCTTTAAA GTTGCCAGAT TGATCGATGA GGCGATCGAA
CAGGAATACG TGAGGTTTAT CTTCCCTAAG CAGCAAGCGA TGGATGAAGA AATCGCTAAT
AATCTACGGA AAAAATTTCA TCTGGAAGAT GCAATTGTTC TTTCAGTTGC TGAATCCTGG
ACGACGCAAG AAGAACTCAA TCACAAATTG GGTGAAGTCA CCGCTGAATA TCTTATGCAA
TCTCTTCATG AAGATATGAA AGTGGGGATC GCCTGGGGAC GTGTATTATC AAGCACGGTC
AGTAAGTTGA GCAAGTTGCC TCCTTTAGAC GTTGTGCAGT TATCTGGCGT ACATCCGGGG
ATCGAGTTTA GTCAGGGGCC AATAGATCTT ATTCATAAGA TCGCTGCCAT TTCGCAGGGA
AAAGCGCACC CAATGTACGT GCCGATGTGG GTCGATGACG AAGAGCTTGC TGCCAGACTG
GCAGGTGATC CTGCGGTATT AGATACACAG CAATATTACT CACAGTTGGA TGTGGTTATC
ACCGGGATAG GTGACTGGAA ATCAGGTTCT TCAAGCTTGT GTAAAATATT TCCGGATACC
TGGTGCGAAG CTTTGTTTCA ACAAGATATC GCTGCGGATG TGTGTATCTC GTTGGTCAGC
AGGGAAGGGA AGATTCTTCA TAGTCCTATT GAACGTCTGG GATTTGGCAT TTCGACGGAT
CAACTACAAA AAGCCAAAAA AGTGATTGGT GTCGCTGGAG GAGAAGAAAA ATATGAAGGC
ATTCTTGCTT CGCTGAAATC TGGACTTTTA AATGTCTTAA TTACTGATTT TGATACGGCC
ATTAAACTTC TGGATTAA
 
Protein sequence
MEKRMTKQKK NSRWQGYDPR YIYSLVVRRY FADMKTKIEI AEELGVSRFK VARLIDEAIE 
QEYVRFIFPK QQAMDEEIAN NLRKKFHLED AIVLSVAESW TTQEELNHKL GEVTAEYLMQ
SLHEDMKVGI AWGRVLSSTV SKLSKLPPLD VVQLSGVHPG IEFSQGPIDL IHKIAAISQG
KAHPMYVPMW VDDEELAARL AGDPAVLDTQ QYYSQLDVVI TGIGDWKSGS SSLCKIFPDT
WCEALFQQDI AADVCISLVS REGKILHSPI ERLGFGISTD QLQKAKKVIG VAGGEEKYEG
ILASLKSGLL NVLITDFDTA IKLLD