Gene EcolC_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4206 
Symbol 
ID6067713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4645791 
End bp4647026 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content52% 
IMG OID641603634 
Productradical SAM domain-containing protein 
Protein accessionYP_001727130 
Protein GI170022176 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000150338 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG TGGTTCCGAT 
TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG
CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATA TCCGTCACTA TATCGCCGCC
AGCGAACTGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GCTACTCGGG
CTGGAGTTTT ACCGCCGTGC CGTAGCGCTA CAGGCGAAAT ATGGTGCTGG CAGGAAGATA
AGTAACAGCT TCCAGACTAA CGGCGTGCTG CTGGATGACG AATGGTGCGC GTTTCTCGCG
GAGCATCATT TTCTTGTTGG TTTATCGCTG GATGGTCCGC CTGAGATCCA CAATCAATAT
CGCGTGACTA AAGGTGGCAG ACCCACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTGCTG
CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTTA ATCGCACCAG CGCGCAGCAA
CCGTTGCAGG TATATGATTT TTTGTGCGAT GCGGGAGTGG AATTCATCCA GTTTATTCCG
GTGGTCGAGC GCCTGGCTGA TGAAACGGCT GCCCGCGAAG GACTGAAATT GCATGCGCCT
GGTGATATTC AGGGTGAGCT AACGGAATGG TCGGTGCGCC CCGAGGAGTT CGGTGAATTT
CTGGTGGCGA TATTCGACCA CTGGATTAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT
ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA
ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AACGGCGACG TTTACGCCTG CGATCACTAT
GTTTATCCAC AATATCGGCT GGGGAATATG CACCAGCAAA CAATTGCAGA AATGATCGAT
TCCCCGCAAC AGCAGGCGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC
AGTTGTAACG TGTTAAAAGC GTGCTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT
GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT
CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGTGAC
ATTATGCATG CGCATTTGCT GGTGGTGAGT AAGTAA
 
Protein sequence
MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYIRHYIAA 
SELQNEVAFT WQGGEPTLLG LEFYRRAVAL QAKYGAGRKI SNSFQTNGVL LDDEWCAFLA
EHHFLVGLSL DGPPEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ
PLQVYDFLCD AGVEFIQFIP VVERLADETA AREGLKLHAP GDIQGELTEW SVRPEEFGEF
LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY
VYPQYRLGNM HQQTIAEMID SPQQQAFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD
ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMHAHLLVVS K