Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2236 |
Symbol | |
ID | 6066347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2454711 |
End bp | 2456054 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641601641 |
Product | hypothetical protein |
Protein accession | YP_001725200 |
Protein GI | 170020246 |
COG category | [S] Function unknown |
COG ID | [COG5383] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.953414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAACA GCATCACGGC GGATGAGATT CGGGAACAGT TTTCGCAGGC AATGTCAGCC ATGTACCAGC AAGAAGTTCC GCAGTACGGC ACGCTGCTGG AACTGGTAGC TGATGTGAAT CTGGCTGTGC TGGAAAACAA TCCTCAACTG CACGAAAAAA TGGTAAATGC AGACGAGCTG GCGCGACTGA ATGTTGAACG TCATGGGGCG ATTCGCGTTG GGACTGCACA AGAGCTTGCT ACTCTTCGGC GGATGTTTGC CATTATGGGG ATGTACCCGG TGAGCTATTA CGATCTCTCG CAGGCGGGGG TGCCGGTACA TTCGACAGCA TTTCGGCCCA TTGATGATGC TTCTCTGGCG CGTAATCCCT TCCGCGTTTT TACCTCGTTG CTCCGCCTTG AGCTTATCGA GAACGAAATT TTGCGCCAGA AAGCGGCGGA GATTCTACGT CAGCGCGATA TCTTCACCCC ACGTTGTCGA CTACTGTTAG AGGAATATGA GCAGCGGGGC GGTTTTAACG AAACACAGGC ACAGGAGTTT GTGCAGGAAG CCCTGGAAAC GTTTCGCTGG CACCAGTCAG CAACGGTAGA TGAAGAAACC TATCGCGCCT TGCACAACGA ACATCGGTTG ATTGCTGATG TGGTCTGTTT TCCTGGATGC CATATCAACC ATCTGACGCC ACGTACGCTG GATATTGACC GGGTGCAGTC GATGATGCCT GAATGCGGAA TTGAACCAAA AATTCTGATC GAAGGGCCGC CGCGCCGCGA GGTACCGATT TTACTACGCC AGACCAGCTT TAAAGCACTG GAAGAGACGG TGTTGTTTGC GGGGCAGAAA CAGGGCACGC ATACTGCGCG CTTTGGTGAA ATTGAGCAGC GTGGCGTGGC ATTAACGCCG AAAGGGCGAC AACTGTATGA TGATCTTCTG CGTAACGCTG GAACCGGGCA GGATAATCTC ACTCACCAAA TGCATTTACA GGAAACCTTC CGCACTTTTC CTGACAGTGA GTTTTTAATG CGTCAGCAAG GGCTGGCATG GTTCCGGTAC CGTCTGACGC CTTCGGGTGA GGCGCATCGT CAGGCGATTC ATCCCGGAGA CGATCCACAG CCCTTAATTG AACGTGGTTG GGTCGCGGCG CAACCCATTA CCTATGAAGA TTTCTTGCCC GTTAGCGCGG CGGGGATCTT CCAGTCAAAT CTGGGTAATG AAACGCAGGC ACGCAATCAC GGTAATGCCA GTCGCGAAGC ATTTGAGCAG GCGTTGGGTT GTCCGGTTTT GGATGAGTTC CAGCTTTATC AGGAAGCGGA AGAACGCAGT AAACGTCGCT GTGGTTTGCT TTAA
|
Protein sequence | MANSITADEI REQFSQAMSA MYQQEVPQYG TLLELVADVN LAVLENNPQL HEKMVNADEL ARLNVERHGA IRVGTAQELA TLRRMFAIMG MYPVSYYDLS QAGVPVHSTA FRPIDDASLA RNPFRVFTSL LRLELIENEI LRQKAAEILR QRDIFTPRCR LLLEEYEQRG GFNETQAQEF VQEALETFRW HQSATVDEET YRALHNEHRL IADVVCFPGC HINHLTPRTL DIDRVQSMMP ECGIEPKILI EGPPRREVPI LLRQTSFKAL EETVLFAGQK QGTHTARFGE IEQRGVALTP KGRQLYDDLL RNAGTGQDNL THQMHLQETF RTFPDSEFLM RQQGLAWFRY RLTPSGEAHR QAIHPGDDPQ PLIERGWVAA QPITYEDFLP VSAAGIFQSN LGNETQARNH GNASREAFEQ ALGCPVLDEF QLYQEAEERS KRRCGLL
|
| |