Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2214 |
Symbol | |
ID | 5588034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2180371 |
End bp | 2181519 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640925882 |
Product | hypothetical protein |
Protein accession | YP_001463282 |
Protein GI | 157158614 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAA ATAAAATTTA CGCTGTGTTA ACTGATCGTG GCGCGCAGTT AGAAGCTGCG GCGCTGGCGT CAGGAGTGCC GGTACTGCTA AATAAATTCG TTATTGGTGA CGCGAACGGA AACGACGACG TAACGCCAGA CCCGGCCCGA ACGGCATTAA TTCACGAGAC GTATCGCGGA GATATTAAAT CGTCAGAAAA TAGCGGTAAT CAAGTCATTT TTACACTATA CGTACCACCG GAAACCGGCG GCTATACTAT CCGTGAGGTG GGAATATTAA CCGACAAAGG TGAACTGTAC TCTGTAGCGC GTTCGCCGGA TATTTTAAAA CCTACGGACA GTAACGGCGC ACTGATTTCA ATCACGTATA AATACACCCT CGCGGTGTCC AGCACATCTA CTGTTAACGT AGTTATTGAT AACAGTAGCG GAATGAACCA GGCAGATGCT GATAAGCGCT ATTTGCAGAT AAGCAAAAAT TTATCTGAAA TTAAAAATAA GGGCGAATCC GCTCAACGAG CAGGGCGGGA GAATCTCGGT ATTGATTTAG ACGATTATTA CGATAAAACT GAGATTGATA GCAAATTTAC TGATATTGAT GAAGATATTA ATAATATAAA TAAAACAAAA CCTGTTCTTA CAGTAAATAA TATACAGCCT GATGCTACTG GGAATGTAAA TACAGGTTCC GGATTTGCAA AACCAAACGG CGATGGGGCG TTTAATCTGG TTATGTTATA CGGCGGTGAC CACGTCAGTG TTACTCCTGC GATGACAATC GTAACAGGTT ATGATGTATC CCCGTACGCG ATAAATCCAA CCAATGTAAA CGCCGATGTG GAAACTTATT TGTGCGGTGC GTGGATGACG TTAGCGGCAA CGAGTGGAAA TGCGTTAGTG ATGGCGCAGC GAATTCCCAT TGGGAATATA TCAAAAATGT TAAATATACG AGATCCGTAT TACCCGAATA AACACGGAAC CGTTAACTCA TATGATCACA ATTATATCTG CGTTAAATGT AATATTGAAG GAATAGATAA TGACATCATA TTTACGTCAA ATCTGAAAGA CGTTGAGGAA TATGGTGTGC AGGTTTTCCA GAACGCCAAA AATGGTATTT ACGGCACCGT TATAGACGAA GGTCATTGA
|
Protein sequence | MAENKIYAVL TDRGAQLEAA ALASGVPVLL NKFVIGDANG NDDVTPDPAR TALIHETYRG DIKSSENSGN QVIFTLYVPP ETGGYTIREV GILTDKGELY SVARSPDILK PTDSNGALIS ITYKYTLAVS STSTVNVVID NSSGMNQADA DKRYLQISKN LSEIKNKGES AQRAGRENLG IDLDDYYDKT EIDSKFTDID EDINNINKTK PVLTVNNIQP DATGNVNTGS GFAKPNGDGA FNLVMLYGGD HVSVTPAMTI VTGYDVSPYA INPTNVNADV ETYLCGAWMT LAATSGNALV MAQRIPIGNI SKMLNIRDPY YPNKHGTVNS YDHNYICVKC NIEGIDNDII FTSNLKDVEE YGVQVFQNAK NGIYGTVIDE GH
|
| |