Gene EcE24377A_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3478 
Symbol 
ID5587362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3487528 
End bp3488655 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content50% 
IMG OID640927105 
ProductAraC family transcriptional regulator 
Protein accessionYP_001464475 
Protein GI157156780 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTTG GGGTGTGCAG ATTAAAGTTG TTCATTACTT GCTCCCTTTG CTGGGCCAAT 
ATGAGGGCAG AGAACGATCT GCCTGATGTT TTTCATTGTG ATCGCCAGCG CCCTGGCTCT
CAATGCTCAT TTCTGCCAAT GTCTTGCCTA TTTCTCCAGA GTGCTGGAGA AATGCTACAA
AATTGCGCAC AATCAAATTG CCGCATTATT CCTAAGAAAT TACGCGATAT GAAACGTGAA
GAGATTTGCC GTTTGCTGGC GAATAAAGTT AATAAAATGA AAAATAAAGA AAATAGTTTG
TCAGAACTGT TGCCCGATGT GCGTTTGTTG TATGGCGAGA CGCCTTTCGC ACGTACACCG
GTGATGTACG AGCCTGGCAT CATAATTCTC TTTTCCGGAC ATAAAATCGG TTATATCAAT
GAACGCGTGT TTCGTTATGA TGCCAATGAA TACCTGCTGC TGACGGTGCC GTTGCCGTTT
GAGTGCGAAA CCTATGCCAC GTCAGAGATG CCGCTGGCAG GGGTGCGTCT CAATGTCGAT
ATTTTGCAGT TACAGGAACT GTTGATGGAC ATTGGCGAAG ATGAGCATTT CCAGCCGTCG
ATGGCAGCCA GCGGGATTAA CTCCGCCACG TTATCAGAAG AGATTTTATG CGCGGCGGAG
CGGTTACTCG ACGTGATGGA GCGACCACTG GATGCGCGTA TTCTCGGCAA ACAGATCATC
CGCGAAATTC TGTACTACGT GCTGACCGGA CCTTGCGGCG GCGCGTTACT GGCGCTGGTC
AGTCGCCAGA CTCACTTCAG TCTGATTAGC CGCGTGCTGA AACGGATTGA GAATAAATAC
ACCGAAAACC TGAGCGTCGA GCAACTGGCG GCAGAAGCCA ATATGAGCGT ATCGGCGTTC
CACCATAATT TTAAGTCTGT CACAAGCACC TCGCCGTTGC AGTATTTGAA GAATTACCGT
CTGCATAAGG CGCGGATGAT GATCATCCAT GACGGCATGA AGGCCAGCGC AGCAGCGATG
CGCGTCGGCT ATGAAAGCGC ATCGCAATTT AGCCGTGAGT TTAAACGTTA CTTCGGTGTG
ACGCCGGGGG AAGATGCGGC AAGAATGCGG GCGATGCAGG GGAATTAA
 
Protein sequence
MRVGVCRLKL FITCSLCWAN MRAENDLPDV FHCDRQRPGS QCSFLPMSCL FLQSAGEMLQ 
NCAQSNCRII PKKLRDMKRE EICRLLANKV NKMKNKENSL SELLPDVRLL YGETPFARTP
VMYEPGIIIL FSGHKIGYIN ERVFRYDANE YLLLTVPLPF ECETYATSEM PLAGVRLNVD
ILQLQELLMD IGEDEHFQPS MAASGINSAT LSEEILCAAE RLLDVMERPL DARILGKQII
REILYYVLTG PCGGALLALV SRQTHFSLIS RVLKRIENKY TENLSVEQLA AEANMSVSAF
HHNFKSVTST SPLQYLKNYR LHKARMMIIH DGMKASAAAM RVGYESASQF SREFKRYFGV
TPGEDAARMR AMQGN