Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4206 |
Symbol | |
ID | 6067713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4645791 |
End bp | 4647026 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603634 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001727130 |
Protein GI | 170022176 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000150338 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG TGGTTCCGAT TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATA TCCGTCACTA TATCGCCGCC AGCGAACTGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GCTACTCGGG CTGGAGTTTT ACCGCCGTGC CGTAGCGCTA CAGGCGAAAT ATGGTGCTGG CAGGAAGATA AGTAACAGCT TCCAGACTAA CGGCGTGCTG CTGGATGACG AATGGTGCGC GTTTCTCGCG GAGCATCATT TTCTTGTTGG TTTATCGCTG GATGGTCCGC CTGAGATCCA CAATCAATAT CGCGTGACTA AAGGTGGCAG ACCCACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTGCTG CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTTA ATCGCACCAG CGCGCAGCAA CCGTTGCAGG TATATGATTT TTTGTGCGAT GCGGGAGTGG AATTCATCCA GTTTATTCCG GTGGTCGAGC GCCTGGCTGA TGAAACGGCT GCCCGCGAAG GACTGAAATT GCATGCGCCT GGTGATATTC AGGGTGAGCT AACGGAATGG TCGGTGCGCC CCGAGGAGTT CGGTGAATTT CTGGTGGCGA TATTCGACCA CTGGATTAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AACGGCGACG TTTACGCCTG CGATCACTAT GTTTATCCAC AATATCGGCT GGGGAATATG CACCAGCAAA CAATTGCAGA AATGATCGAT TCCCCGCAAC AGCAGGCGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC AGTTGTAACG TGTTAAAAGC GTGCTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGTGAC ATTATGCATG CGCATTTGCT GGTGGTGAGT AAGTAA
|
Protein sequence | MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYIRHYIAA SELQNEVAFT WQGGEPTLLG LEFYRRAVAL QAKYGAGRKI SNSFQTNGVL LDDEWCAFLA EHHFLVGLSL DGPPEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ PLQVYDFLCD AGVEFIQFIP VVERLADETA AREGLKLHAP GDIQGELTEW SVRPEEFGEF LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY VYPQYRLGNM HQQTIAEMID SPQQQAFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMHAHLLVVS K
|
| |