Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1932 |
Symbol | |
ID | 6068607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2134005 |
End bp | 2135294 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601343 |
Product | hypothetical protein |
Protein accession | YP_001724905 |
Protein GI | 170019951 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0346212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC GCACTGGTCA TGGCACGAGC CGGGCTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC GGATGTAAAA ACATGACCGG CGGGCGCCTT TATGCCCACA CACTTGAAGC AATCATTCCA GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA ACCGAAGAGA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA TGCCGAGCAG GCGGGCGCAC AGTTTATCCC GGGAGTTCGC GTCGATGCGC TGGTTCGTGA AGGAAACAAG GTCACTGGCG TCCAGGCTGG GGATGATATT CTCGAAACGA ATGTGGTGAT TCTGGCTGAT GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG ATGGGTGGCG GATTCCTCTA TACCAATAAG GATTCCATAT CGTTGGGGCT GGTTTGTGGA TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA CACCCCGCCA TTCGCCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG GTGCCGGAAG GCGGTCTGGC AATGGTGCCG CAACTGGTTA ACGAGGGCGT GATGATCGTT GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACGG TCCGCGGCAT GGATTTAGCC ATTGCATCGG CTCAAGCTGC CGCCACAACG GTGATCGCCG CCAAAGAACG CGCGGATTTC TCCGCCAGCA GTCTTGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGTGAT ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAACC CGCGCCTGTT TAGCCAATAC CCACGAATGG TAGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGCAA ACCAAACCAG CCGGTACGCA AAATGATCAT GGGACATGCG AAGAAAATCG GGCTGATCAA CTTGCTGAAA GATGGCATTA AGGGAGCAAC CGCGCTATGA
|
Protein sequence | MSDDKFDAIV VGAGVAGSVA ALVMARAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP GFAASAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEHAEQ AGAQFIPGVR VDALVREGNK VTGVQAGDDI LETNVVILAD GVNSMLGRSL GMVPASDPHH YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG LGDIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QLVNEGVMIV GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK DGIKGATAL
|
| |