Gene EcolC_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3709 
Symbol 
ID6064706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4059436 
End bp4060326 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content52% 
IMG OID641603127 
ProductAraC family transcriptional regulator 
Protein accessionYP_001726647 
Protein GI170021693 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID[TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.680131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGACC GTCAGATTGC CAATATTGAT ATCAGCAAAG AGTACGATGA AAGCCTGGGC 
ACGGACGATG TGCATTATCA GTCGTTCGCC CGCATGGCGG CCTTTTTTGG CCGCCATATG
CTGCCACATC GCCACGAACA GTACTTTCAG ATGCATTTCC TCAATAGCGG ACAGATTGAG
CTACAGCTTG ACGATCATCG CTACTCGGTG GAAGCGCCCC TGTTTGTCCT GACGCCGCCG
TCAGTACCTC ATGCGTTTAT TACGGAGTCT GATGCTGACG GTCATGTATT GACGGTACGG
GAAGATCTGA TCTGGCCCCT GCTGGAAGTT CTTTATCCGG GCACTCGGGA AACCTTCGGC
CTGCCGGGGA TTTGCCTGTC ACTGGCAGAT AAACCCGACG AACTGGCGGC GCTGGAACAC
TATTGGCAAC TGATAGAGCG GGAATCGGTA GAACAACTGC CTGGACGGGA ACACACCCTG
ACGTTACTGG CACAGGCAGT GTTCACCCTA CTGCTGCGTA ACGCAAAACT CGACGACCAT
GCCGCCAGCG GAATGCGCGG AGAATTAAAA CTGTTCCAGC GTTTTCATAT GCTTATTGAA
AGCCATTTTC ATCAGCACTG GACAGTACCG GATTACGCTA ACGAACTGCA TATCACCGAA
TCACGCCTCA CGGACATCTG CCGCCGCTTT GCCAACCGTC CGCCAAAACG GTTGATTTTC
GACAGGCAGC TACGAGAAGC CAAGCGGCTG CTGCTGTTTT CTGATAACGC CGTGAACAAT
ATTGCCTGGC AACTCGGTTT TAAGGATCCG GCTTATTTTG CGCGCTTTTT TAATCGCTTA
GTCGGTTGCT CGCCCAGTGC TTATCGTGCC AAAAAAGTAC CTGTGACGTG A
 
Protein sequence
MCDRQIANID ISKEYDESLG TDDVHYQSFA RMAAFFGRHM LPHRHEQYFQ MHFLNSGQIE 
LQLDDHRYSV EAPLFVLTPP SVPHAFITES DADGHVLTVR EDLIWPLLEV LYPGTRETFG
LPGICLSLAD KPDELAALEH YWQLIERESV EQLPGREHTL TLLAQAVFTL LLRNAKLDDH
AASGMRGELK LFQRFHMLIE SHFHQHWTVP DYANELHITE SRLTDICRRF ANRPPKRLIF
DRQLREAKRL LLFSDNAVNN IAWQLGFKDP AYFARFFNRL VGCSPSAYRA KKVPVT