Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2408 |
Symbol | |
ID | 5588981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2388074 |
End bp | 2390353 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640926070 |
Product | hypothetical protein |
Protein accession | YP_001463465 |
Protein GI | 157155415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC CGTTAATTGT CGGCATCCGG CATCATAGTC CGGCCTGCGC CCGGCTGGTG AAATCGTTAA TCGAAAGCCA GCGGCCACGA TACGTGTTGA TTGAAGGCCC GGCTGATTTT AATGACCGGG TAGACGAACT TTTTTTTTCC CACCAGCTTC CGGTAGCTAT TTACAGTTAT TGCCAGTATC AGGACGGTGC AGCCCCCGGG CGTGGTGCCT GGACGCCATT TGCTGAATTT TCGCCGGAGT GGCAGGCGCT ACAAGCCGCA CGTCGTATTC AGGCACAAAC TTACTTCATC GATTTGCCTT GCTGGGCGCA GAGTGAAGAA GAGGACGATT CGCCTGATAC GCAAGATGAA AGCCAGGCCT TACTGCTGCG TGCCACCCGC ATGGATAACA GCGATACCCT GTGGGATCAC TTGTTCGAAG ATGAAAGCCA GCAAACTGCA TTACCCTCTG CGCTGGCGCA CTATTTTGCC CAACTGCGGG GCGATTTCCC CGGCGATGCA CTCAATCGTC AGCGCGAAGC CTTTATGGCT CGCTGGATTG CATGGGCGGT GCAGCAAAAT AATGGCGACG TGTTAGTCGT CTGCGGTGGC TGGCACGCTC CGGCACTGGC AAAAATGTGG CGCGAATGCC CGCAGGACAT TAACACGCCG GAATTGCCCT CGCTGGCAGA TGCCATTACA GGTTGTTATC TCACGCCCTA CAGTGAAAAG CGCCTTGATG TGCTGGCAGG ATACCTTTCC GGTATGCCTG CCCCGGTCTG GCAAAACTGG TGCTGGCAGT GGGGCTTACA GCAGGCCGGT GAACAACTGC TAAAAACGGT TCTCACCCGT TTGCGCCAGC ACAACTTGCC TGCTTCGACC GCGGATATGG CTGCCGCACA TCTGCATGCA ATGGCACTGG CACAGTTGCG CGGTCATACA CTACCGTTAC GCACTGACTG GCTGGATGCC ATAGCAGGCT CGTTGATTAA AGAAGCCCTG AATGCACCGT TGCCGTGGAG CTATCGCGGC GTTATTCATC CCGATACCGA TCCGATTCTG CTAACGTTGA TAGACACATT AGCGGGTGAC GGATTCGGTA AACTTGCCCC TTCCACGCCA CAACCGCCTC TGCCAAAAGA TGTCACCTGC GAACTGGAAC GTACCGCAAT CTCTCTTCCG GCGGAGCTTA CCTTAAATCG CTTTACCCCC GATGGGCTGG CGCAAAGTCA GGTGTTACAT CGGCTGGCAA TACTGGAGAT CCCTGGGATT GTACGCCAGC AGGGAAGTAC ACTGACACTT GCAGGCAACG GTGAAGAACA CTGGAAATTA ACCCGCCCGC TTAGCCAGCA TGCGGCATTG ATTGAGGCCG CATGCTTTGG TGCAACACTT CAGGAAGCCG CACGCCATAA ATTAGAAGCC GATATGCTGG ACGCGGGTGG AATCGGCAGT ATCACCACAT GTCTTAGCCA GGCGGCGTTA GCGGGTCTGG CGTCCTTCAG TCAACAATTA CTGGAGCAAC TCACATTATT AATCGCCCAG GAAAATCAAT TTGCCGAAAT GGGCCAGGCG CTGGAAGTGC TATATGCCTT ATGGCGGCTG GATGAAATTA GCGGTATGCA AGGCGCGCAG ATATTACAAA CGACGTTATG CGCGGCTATC GATCGCACGC TGTGGCTGTG TGAATCTAAC GGCAGACCGG ATGAAAAGGA GTTTCACGCT CACCTGCATA GCTGGCAAGC GCTTTGCCAT ATTCTGCGCG ATCTACATAG CGGCGTTAAT TTATCCGGCG TTTCGCTTTC TGCGGCGGTA GCCTTACTGG AGCGACGCAG TCAGGCGATT CATGCCCCGG CGCTGGATCG CGGCGCGGTT CTTGGCGCTC TAATGCGTCT GGAACATCCC AACGCCAGTG CCGAAGCGGC GCTGACGATG CTGGCGCAGT TATCCCCGGC ACAATCCGGC GAGGCGCTGC AAGGTTTGCT GGCATTAGCC CGTCATCAAC TGGCCTGTCA GCCGACATTT ATCGCCGGTT TCAGCAGTCA TTTAAATCAA CTAAGTGATG CCGATTTTAT CAATGCCCTG CCCGATTTAC GCGCGGCGAT GGCCTGGCTA CCGCCACGAG AACGCGGAAC GCTGGCGCAT CAGGTGCTTG AGCATTATCA ACTGGCGCAA CTTCCCGTTT CGGCACTGCA AATGCCGTTG CATTGTCCAC CGCAAGCCAT TGCACATCAT CAACAACTCG AACAGCAGGC ACTGGCATCG CTGCAACACT GGGGAGTTTT CCATGTCTGA
|
Protein sequence | MSEPLIVGIR HHSPACARLV KSLIESQRPR YVLIEGPADF NDRVDELFFS HQLPVAIYSY CQYQDGAAPG RGAWTPFAEF SPEWQALQAA RRIQAQTYFI DLPCWAQSEE EDDSPDTQDE SQALLLRATR MDNSDTLWDH LFEDESQQTA LPSALAHYFA QLRGDFPGDA LNRQREAFMA RWIAWAVQQN NGDVLVVCGG WHAPALAKMW RECPQDINTP ELPSLADAIT GCYLTPYSEK RLDVLAGYLS GMPAPVWQNW CWQWGLQQAG EQLLKTVLTR LRQHNLPAST ADMAAAHLHA MALAQLRGHT LPLRTDWLDA IAGSLIKEAL NAPLPWSYRG VIHPDTDPIL LTLIDTLAGD GFGKLAPSTP QPPLPKDVTC ELERTAISLP AELTLNRFTP DGLAQSQVLH RLAILEIPGI VRQQGSTLTL AGNGEEHWKL TRPLSQHAAL IEAACFGATL QEAARHKLEA DMLDAGGIGS ITTCLSQAAL AGLASFSQQL LEQLTLLIAQ ENQFAEMGQA LEVLYALWRL DEISGMQGAQ ILQTTLCAAI DRTLWLCESN GRPDEKEFHA HLHSWQALCH ILRDLHSGVN LSGVSLSAAV ALLERRSQAI HAPALDRGAV LGALMRLEHP NASAEAALTM LAQLSPAQSG EALQGLLALA RHQLACQPTF IAGFSSHLNQ LSDADFINAL PDLRAAMAWL PPRERGTLAH QVLEHYQLAQ LPVSALQMPL HCPPQAIAHH QQLEQQALAS LQHWGVFHV
|
| |