Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1848 |
Symbol | |
ID | 6065222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2046450 |
End bp | 2047733 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641601262 |
Product | hypothetical protein |
Protein accession | YP_001724824 |
Protein GI | 170019870 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0243705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGT TTATTGACCG GCGTCTGAAC GGCAAAAACA AAAGCATGGT GAATCGCCAG CGTTTTTTAC GCCGTTATAA AGCGCAAATT AAACAGTCGA TCTCCGAGGC CATTAATAAG CGTTCGGTGA CTGACGTCGA CAGCGGCGAA TCCGTATCCA TTCCCACGGA AGATATTAGC GAACCGATGT TTCATCAGGG GCGTGGCGGT CTGCGCCACC GCGTGCATCC GGGCAATGAC CATTTCGTCC AGAACGACCG AATTGAACGT CCCCAGGGTG GCGGCGGAGG TTCCGGCAGT GGTCAGGGCC AGGCCAGCCA GGATGGTGAA GGTCAGGATG AATTTGTCTT TCAGATTTCG AAAGATGAGT ATCTTGATCT GCTCTTTGAA GATTTGGCCT TACCGAATCT GAAACAAAAC CAACAACGCC AGCTGACCGA ATATAAAACG CATCGGGCGG GTTATACCGC TAACGGCGTT CCGGCCAATA TCAGCGTTGT GCGTTCATTG CAGAACTCAC TGGCGCGACG CACAGCCATG ACGGCAGGCA AGCGGCGGGA ACTTCATGCA CTGGAAGAGA ATTTGGCCAT CATCAGCAAC AGTGAACCTG CGCAACTGCT GGAAGAGGAA CGTCTGCGCA AAGAAATTGC AGAATTACGT GCCAAAATTG AACGCGTCCC TTTTATTGAC ACCTTCGATT TACGTTACAA GAACTACGAG AAGCGGCCCG ATCCCTCCAG CCAGGCAGTG ATGTTTTGCC TGATGGACGT TTCCGGTTCA ATGGATCAAT CCACTAAAGA TATGGCTAAG CGTTTTTATA TTCTGCTGTA TCTGTTCCTC AGCAGAACGT ATAAGAACGT GGAAGTCGTA TACATCCGCC ATCATACCCA GGCGAAAGAA GTCGATGAAC ATGAGTTTTT CTACTCGCAG GAAACAGGCG GCACCATTGT TTCCAGCGCC CTGAAACTGA TGGATGAGGT AGTGAAAGAG CGTTATAACC CGGCACAGTG GAATATTTAC GCTGCACAAG CATCGGACGG CGATAACTGG GCCGATGACT CTCCGCTTTG CCATGAAATC CTGGCGAAAA AATTATTACC TGTTGTTCGT TATTACAGCT ATATCGAAAT TACCCGTCGT GCACATCAGA CATTGTGGCG AGAATATGAG CATCTGCAAT CTACTTTCGA CAACTTTGCG ATGCAGCACA TCCGCGACCA GGATGATATT TATCCGGTGT TCCGTGAACT GTTTCATAAA CAAAATGCAA CAGCTAAAGA CTAA
|
Protein sequence | MTWFIDRRLN GKNKSMVNRQ RFLRRYKAQI KQSISEAINK RSVTDVDSGE SVSIPTEDIS EPMFHQGRGG LRHRVHPGND HFVQNDRIER PQGGGGGSGS GQGQASQDGE GQDEFVFQIS KDEYLDLLFE DLALPNLKQN QQRQLTEYKT HRAGYTANGV PANISVVRSL QNSLARRTAM TAGKRRELHA LEENLAIISN SEPAQLLEEE RLRKEIAELR AKIERVPFID TFDLRYKNYE KRPDPSSQAV MFCLMDVSGS MDQSTKDMAK RFYILLYLFL SRTYKNVEVV YIRHHTQAKE VDEHEFFYSQ ETGGTIVSSA LKLMDEVVKE RYNPAQWNIY AAQASDGDNW ADDSPLCHEI LAKKLLPVVR YYSYIEITRR AHQTLWREYE HLQSTFDNFA MQHIRDQDDI YPVFRELFHK QNATAKD
|
| |