Gene EcolC_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1123 
Symbol 
ID6067969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1224120 
End bp1225454 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content55% 
IMG OID641600539 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_001724117 
Protein GI170019163 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATA AACCTGCGCA TTTATTATTG GTCGATGACG ATCCGGGATT GCTGAAACTG 
CTTGGCCTGC GCCTGACCAG CGAAGGCTAC AGTGTGGTCA CGGCGGAAAG TGGCGCTGAA
GGATTACGGG TACTGAATCG CGAAAAAGTA GATTTAGTCA TCAGCGACCT GCGGATGGAT
GAAATGGACG GTATGCAGCT GTTTGCTGAA ATCCAGAAAG TGCAGCCGGG AATGCCGGTA
ATTATTCTTA CCGCGCATGG TTCTATTCCC GATGCCGTTG CTGCAACACA GCAGGGCGTT
TTTAGTTTCC TCACCAAGCC TGTCGACAAA GACGCGCTAT ATCAGGCAAT TGACGATGCG
CTGGAGCAAT CCGCGCCAGC CACCGATGAA CGCTGGCGCG AGGCAATTGT CACCCGCAGC
CCGCTGATGC TGCGTTTGCT GGAACAGGCG CGGCTGGTGG CGCAATCAGA CGTCAGCGTT
TTGATTAACG GTCAGAGCGG CACCGGGAAA GAGATTTTCG CCCAGGCTAT CCACAACGCC
AGCCCGCGCA ACAGCAAACC ATTTATTGCT ATTAACTGTG GCGCATTGCC CGAGCAATTG
CTGGAGTCGG AGCTGTTTGG TCATGCGCGT GGCGCGTTTA CTGGCGCTGT CAGCAATCGC
GAAGGTTTAT TCCAGGCGGC GGAAGGCGGA ACGCTATTTC TCGATGAAAT TGGCGATATG
CCTGCGCCGT TACAGGTCAA ACTGCTGCGC GTGTTGCAGG AGCGTAAAGT GCGCCCGCTG
GGCAGTAACC GCGATATTGA TATCAATGTG CGGATTATTT CTGCCACTCA CCGTGATCTG
CCAAAAGCGA TGGCGCGCGG GGAATTCCGC GAAGATCTCT ATTACCGCCT CAACGTTGTC
AGCCTGAAAA TTCCGGCACT GGCGGAGCGC ACAGAAGACA TTCCGCTACT GGCAAATCAC
CTGTTGCGCC AGGCGGCAGA GCGACATAAA CCGTTTGTCC GCGCGTTCTC TACCGATGCG
ATGAAACGCC TGATGACCGC GAGCTGGCCG GGTAATGTGC GCCAGTTGGT CAACGTGATT
GAACAGTGCG TGGCGCTGAC CTCATCTCCG GTGATTAGTG ATGCGCTGGT GGAGCAGGCG
CTGGAGGGTG AAAATACGGC GCTGCCAACC TTTGTTGAGG CACGTAATCA GTTTGAACTC
AACTATTTGC GTAAGCTGCT GCAAATCACC AAAGGCAACG TCACCCACGC GGCGAGAATG
GCGGGGCGCA ACCGGACAGA ATTTTATAAA CTGCTTTCCC GACACGAGCT GGATGCAAAC
GATTTCAAGG AATGA
 
Protein sequence
MSHKPAHLLL VDDDPGLLKL LGLRLTSEGY SVVTAESGAE GLRVLNREKV DLVISDLRMD 
EMDGMQLFAE IQKVQPGMPV IILTAHGSIP DAVAATQQGV FSFLTKPVDK DALYQAIDDA
LEQSAPATDE RWREAIVTRS PLMLRLLEQA RLVAQSDVSV LINGQSGTGK EIFAQAIHNA
SPRNSKPFIA INCGALPEQL LESELFGHAR GAFTGAVSNR EGLFQAAEGG TLFLDEIGDM
PAPLQVKLLR VLQERKVRPL GSNRDIDINV RIISATHRDL PKAMARGEFR EDLYYRLNVV
SLKIPALAER TEDIPLLANH LLRQAAERHK PFVRAFSTDA MKRLMTASWP GNVRQLVNVI
EQCVALTSSP VISDALVEQA LEGENTALPT FVEARNQFEL NYLRKLLQIT KGNVTHAARM
AGRNRTEFYK LLSRHELDAN DFKE