Gene EcolC_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3384 
Symbol 
ID6067568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3703400 
End bp3704386 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content52% 
IMG OID641602798 
Productputative sigma54 specific transcriptional regulator 
Protein accessionYP_001726330 
Protein GI170021376 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.399889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC TTATTGCAAC TGCCGCTTCC AGCATTAACG CTTTTACTCT GGCAAAGCGT 
GTCGCCGCAT TTAACGTGCC GGTGCTCATT CAGGGCGAAA CCGGCGCGGG CAAAGAATGC
GTGGCGAAAT ATATTCACAC CGTAGCCTTT GGTGAAAATG ATAACGCGCC CTATATCGGC
GTGAACTGCG CGGCGATCCC AGAAAATATG CTGGAAGCGA CCTTATTTGG CTACGACAAA
GGCGCATTTA CCGGCGCAAT TGCCAGCGTA CCTGGAAAAA TGGAACTGGC GAATAACGGC
ACCTTATTGC TCGATGAAAT TGGCGATATG CCGCTGGCAT TACAGGCCAA AATATTACGC
GTATTGCAGG AACAGCTGGT TGAGCGATTA GGCAGCAACC GACAAATTAA ACTCAATTTT
CGCCTGATTG CCTGCACCAA TAAAAACCTT GAACAGGAAG TCGCTGCCGG GCGTTTTCGT
GAAGATCTCT ATTATCGCCT GGCGGTTATT CCTATTACCA TGCCGCCGCT GCGTGAACGT
CTGAACGATA TTATTCCGCT GGCAGAGTCA TTTATTAAAA AATACTCCAC GGTGCTGGTG
AAAAATATCA CCCTTTCAGA ATCTACCCGC CGGGCGCTGC TCAATTACCG CTGGCCCGGC
AACGTGCGCC AGCTGGAGAA CGCCATACAG CGGGGAATGA TCTTAAACCG CGACGGCGTA
ATTTACCTCG ATGCGTTAGG CCTGCCGGAA AATGACATTG CCGACCGCAG CGAACTGCAA
TGGCCTGTTC AGCCCGCCGT CCACATTGCC GAAACCAGCG ATTTGGGCCA GCACGGACGA
AGCGCCCAGT ATCAATATAT CGCTGACCTG ATGCGTAAAT ATCAGGGCAA CCGCAGCAAA
ATCGCCGACC TGTTAGGCAT TACCCCGCGC GCACTGCGCT ATCGACTGGC CTCCATGCGC
AAGCAAGGTA TCGAAGTTTT CTCCTGA
 
Protein sequence
MSELIATAAS SINAFTLAKR VAAFNVPVLI QGETGAGKEC VAKYIHTVAF GENDNAPYIG 
VNCAAIPENM LEATLFGYDK GAFTGAIASV PGKMELANNG TLLLDEIGDM PLALQAKILR
VLQEQLVERL GSNRQIKLNF RLIACTNKNL EQEVAAGRFR EDLYYRLAVI PITMPPLRER
LNDIIPLAES FIKKYSTVLV KNITLSESTR RALLNYRWPG NVRQLENAIQ RGMILNRDGV
IYLDALGLPE NDIADRSELQ WPVQPAVHIA ETSDLGQHGR SAQYQYIADL MRKYQGNRSK
IADLLGITPR ALRYRLASMR KQGIEVFS