Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1963 |
Symbol | |
ID | 6068321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2167956 |
End bp | 2169560 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641601375 |
Product | hypothetical protein |
Protein accession | YP_001724936 |
Protein GI | 170019982 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000030745 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCTATTGT GGGTGCCGGG CCTACGGGGA TCTACACCTT ATTCTCGCTT CTACAGCAAC AAACTCCACT TTCTATTTCT ATCTTCGAGC AGGCTGACGA GGCCGGTGTC GGGATGCCAT ACAGTGATGA GGAAAACTCA AAAATGATGC TGGCAAATAT TGCCAGTATT GAAATACCGC CGATTAATTG TACGTATCTC GAATGGCTAC AAAAGCAAGA AGCCAGCCAT CTCCAGCGTT ATGGCGTTAA AAAAGAAACC TTGCACGATC GTCAGTTTTT ACCGCGAATT CTGCTGGGCG AATATTTCCG CGATCAATTT TTACGATTAG TAGACCAGGC ACGAAAGCAA AAATTTGCAG TGGCTGTTTA TGAATCATGC CAGGTTACCG ATCTGCAAAT TACAAATGCT GGCGTCATGC TCGCTACAAA TCAGGGTTTA CCCAGAGAGA CGTTTGATTT AGCGGTGATT GCCACGGGTC ACGTCTGGCC TGATGAAGAA GAAGCAACCC GAACGTATTT TCCCAGCCCG TGGTCAGGCC TGATGGAAGC AAAGGTCGAT GCGTGTAACG TGGGTATTAT GGGAACATCC TTGAGCGGAC TGGATGCGGC AATGGCAGTG GCTATTCAGC ATGGTTCGTT CATTGAAGAT GATAAACAAC ACGTCGTTTT TCACCGCGAT AACGCAAGTG AAAAGCTAAA TATCACGTTG ATGTCGCGCA CGGGTATTTT ACCCGAAGCC GATTTCTATT GCCCTATTCC CTACGAGCCC TTACACATTG TCACCGATCA GGCATTAAAT GCTGAGATTC AAAAAGGCGA AGAGGGCCTT TTGGATCGGG TATTTAGATT GATAGTAGAG GAAATCAAGT TTGCTGATCC AGACTGGAGC CAACGCATAG CCTTAGAGAG CCTGAATGTC GATTCCTTTG CTCAAGCCTG GTTTGCCGAG CGCAAACAAC GCGACCCATT TGACTGGGCA GAAAAAAATC TCCAGGAAGT CGAACGCAAT AAACGAGAAA AACATACTGT TCCCTGGCGT TATGTCATTC TGCGCCTGCA TGAAGCCGTA CAGGAAATTG TTCCACATCT GAATGAACAC GACCATAAAC GGTTCAGTAA AGGCCTTGCC CGGGTTTTTA TCGATAATTA TGCGGCAATC CCTTCAGAGT CTATTCGTCG GCTGCTGGCC TTACGTGAAG CGGGGATCAT TCATATTCTC GCCCTCGGTG AAGACTACGA AATGGAAATT AATGAGTCGC GCACCGTCCT GAAAACGGAA GACAACAGCT ACTCGTTTGA CGTTTTTATT GATGCCCGCG GACAGCGTCC GCTTAAAGTG AAAGATATTC CTTTCCCTGG GCTACGCGAG CAATTACAGA AAACAGGGGA TGAAATCCCT GATGTTGGCG AAGATTATAC GTTACAGCAA CCCGAAGATA TTCGTGGACG CGTAGCGTTC GGCGCGTTGC CCTGGTTGAT GCACGACCAG CCTTTCGTTC AGGGACTTAC GGCATGTGCA GAAATTGGTG AGGCGATGGC TCGGGCGGTC GTAAAACCTG CATCCCGTGC ACGTCGGCGT CTTTCGTTTG ATTAA
|
Protein sequence | MKKIAIVGAG PTGIYTLFSL LQQQTPLSIS IFEQADEAGV GMPYSDEENS KMMLANIASI EIPPINCTYL EWLQKQEASH LQRYGVKKET LHDRQFLPRI LLGEYFRDQF LRLVDQARKQ KFAVAVYESC QVTDLQITNA GVMLATNQGL PRETFDLAVI ATGHVWPDEE EATRTYFPSP WSGLMEAKVD ACNVGIMGTS LSGLDAAMAV AIQHGSFIED DKQHVVFHRD NASEKLNITL MSRTGILPEA DFYCPIPYEP LHIVTDQALN AEIQKGEEGL LDRVFRLIVE EIKFADPDWS QRIALESLNV DSFAQAWFAE RKQRDPFDWA EKNLQEVERN KREKHTVPWR YVILRLHEAV QEIVPHLNEH DHKRFSKGLA RVFIDNYAAI PSESIRRLLA LREAGIIHIL ALGEDYEMEI NESRTVLKTE DNSYSFDVFI DARGQRPLKV KDIPFPGLRE QLQKTGDEIP DVGEDYTLQQ PEDIRGRVAF GALPWLMHDQ PFVQGLTACA EIGEAMARAV VKPASRARRR LSFD
|
| |