Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3175 |
Symbol | |
ID | 6066572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3479143 |
End bp | 3481275 |
Gene Length | 2133 bp |
Protein Length | 710 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641602591 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001726125 |
Protein GI | 170021171 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.659876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAT CAATAGTGGA AAATGGAAAA AAACGGGAAG AGACCGTCGA CGTTTCCAGA TTTGCCAAAG AAATTAAAAA GAACGCCTGT AAGATTGTAC TTGCGGGAAT TATCAGCGGT GCAGTTGCCT ACCCATTAAT CAGCATGCTG TCATCAAAAT ATGTCTCAAC AGCTACGGTG TTGCTAAAGG CTCAGGCTGA TAACGTTTCG CCGTTCCCAC AGGTGGAAGA TTTTGATTCC ACGCGCACCG GCTACTATGA GACGCAATAT GCCTTGATGC AGTCGCGTAT TGTTCTGGAG AAAGCGGTTC GCGAGTTAAA GCTGGATCAA AACCCAGACT TTATTGGCAA AAAAGCGGAT GAAAAGGCCA GCAACAGCGA AGATGCTGAA CAGCAGCGCA TTGAGCGCGC GCTGAACACG CTGCAAAAAA ATCTTACCGT TAGCGGTATT CGAACCACTA ATCTGGCGAC AGTCTCTTAT GAGTCGACAT CGCCACAACT CTCCTCTGAG ATTGCCAACG GCGTCGCACA GGCGTTTATC GATTATACGT TGGACCAAAA GCGGCTGAAG ACAGAAAAAG CCAGAGAAGT AAACCTTCAG AAAATGGAGG AAGTGCAGAA AGAGATCGCG CAGCAGAAAG CCGATATCGA TAACTTCCTG GCGAAAGAGG GCTTATTAAC GTTCCGCGGC ATTGATGGCT TCGAAACCGA GCAACTCAGC ATTGTTACCA ACCGTCTGGC CGATGCTACC CAACGGCGTA TTGCGGCAGA ATCTCTGGAA AAAGCCGTCA GCGCTGGGGG CCGGGTCTCT CTGGATAACA TCATTTCATT ACCGACGATC TCTAACCATG CGCAAATTCA GGATTTGCGT ATCGCCATGA TTCAGGCGCA GCGGTCTTTG TATGAGTTAC AAAAATCATA TGGCCCGAAA CATGCGAAGA TCCTGGAAGC GCAGGCTCAG GTGAAGGCTA TTCAGGATCA GATGGGCGTG GTGCTCAGTG AGCTTAAAAA AGGCATTCAT CAGCAATATC TGGCCGCGCT GGCGGATGAA AAGGATTATC AGGCGCAACT TGATCAACAG AAAGAAATTT TTCAGAAACT GGCTGAAAAA CGCAGCCTGT ATAACAGCCA GAAATTGTCA CTGGATAAAC TGGAAGATCT TTATAAAACC CTGTATCAGC GGACCCAGGA ACTGTCTCTG TCCGGCATTA ATGCGGATGC AGTGCTGTAC GATCCGGCTG TCCCGGCAGT GAAGCCATCT AAGCCAAATA AAGCGTTACT GTTAGTGATG GTGGTGGCGC TGGCCATGGC CTTCTTCTTT ATGTACGTCA TTGTAAAAGC GGCGATGGAT AATTCCATCA GGACGCTCGG ACAAGTGACA AAACGACTGG GCGTCGTCTC ACTGGGTGAG ATCCGCCGCA TTGCGGGGGC CGGGGACCGT GCACAGGTTC GCGATTTGAT CACGCGAAAC CCCTTGAACG CCGACATTAT CCACAGCATT CGTACACAGA TTTTGTTGGA TAACCGCCCG CAGCAGGTTC TGGCAATCTC CTCTGCAAAG CAGGGTGAGG GGCGCTCTTT ACTGGCCAGT CTGCTGGCAA ACTCCTTCAG CTTTGATCAG AAAACCTTAC TGCTTGATTT GGATTTCTTT AACCGTGATG GCCTGTCCGC CGAGTTTTCA ACATCGACCT CTGCGGGAGT TGCAGAGCTG TTGCGTGGAG AAGTGACACT TGACGCTGCG CGGATCACGC TTAGTGACAC GCTGGACTTT TTACCCCGCG GAAAAGCGAA CGCTTCGTCT TTGCTGATGC TGTCTTCGGA ACGTTTTGAA CCTCTCATTC GTGACCTGCG AAATCGCTAC CAGCGGATCA TCGTCGATGT CTCTGCGGTG AGCCAGAGTC AGGACATCGA GCTGATTAGT CGGGTGGTTG ATGGTGTGGT TTTCGTTGTG CAAGCGGGGG CTGCGTCCGT GGAGACGCTG CGCGCGGCGC TGGCGAAAGT TGACGCCAAC CAGGAAGTGG TCATGGGAGC GGTACTCAAT CTGGTTGAGG AAAAAAATCT GCAGACGAAA GAGAGTCTTC GCTCGCTCAA TATCACTACT GACGAATTGA TGAATACCAC AGGTCGGTTA TGA
|
Protein sequence | MKLSIVENGK KREETVDVSR FAKEIKKNAC KIVLAGIISG AVAYPLISML SSKYVSTATV LLKAQADNVS PFPQVEDFDS TRTGYYETQY ALMQSRIVLE KAVRELKLDQ NPDFIGKKAD EKASNSEDAE QQRIERALNT LQKNLTVSGI RTTNLATVSY ESTSPQLSSE IANGVAQAFI DYTLDQKRLK TEKAREVNLQ KMEEVQKEIA QQKADIDNFL AKEGLLTFRG IDGFETEQLS IVTNRLADAT QRRIAAESLE KAVSAGGRVS LDNIISLPTI SNHAQIQDLR IAMIQAQRSL YELQKSYGPK HAKILEAQAQ VKAIQDQMGV VLSELKKGIH QQYLAALADE KDYQAQLDQQ KEIFQKLAEK RSLYNSQKLS LDKLEDLYKT LYQRTQELSL SGINADAVLY DPAVPAVKPS KPNKALLLVM VVALAMAFFF MYVIVKAAMD NSIRTLGQVT KRLGVVSLGE IRRIAGAGDR AQVRDLITRN PLNADIIHSI RTQILLDNRP QQVLAISSAK QGEGRSLLAS LLANSFSFDQ KTLLLDLDFF NRDGLSAEFS TSTSAGVAEL LRGEVTLDAA RITLSDTLDF LPRGKANASS LLMLSSERFE PLIRDLRNRY QRIIVDVSAV SQSQDIELIS RVVDGVVFVV QAGAASVETL RAALAKVDAN QEVVMGAVLN LVEEKNLQTK ESLRSLNITT DELMNTTGRL
|
| |