Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2521 |
Symbol | |
ID | 6147487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2578843 |
End bp | 2579988 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641617393 |
Product | hypothetical protein |
Protein accession | YP_001744564 |
Protein GI | 170680633 |
COG category | [C] Energy production and conversion |
COG ID | [COG1804] Predicted acyl-CoA transferases/carnitine dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATA ATGAAAGCAA AGGGCCGTTT GAAGGCTTAT TAGTTATCGA TATGACGCAT GTCCTTAATG GGCCTTTCGG AACTCAACTT CTTTGTAATA TGGGCGCAAG GGTAATTAAA GTTGAGCCGC CAGGTCATGG TGATGATACC CGCACATTTG GTCCCTATGT GGATGGACAG TCACTCTATT ACAGTTTTAT TAATCATGGC AAAGAGAGTG TGGTTCTTGA TTTAAAGAAT GATCACGATA AAAGTATATT TATAAATATG CTTAAACAAG CTGATGTATT AGCTGAGAAT TTTCGCCCAG GTACAATGGA AAAACTGGGG TTTTCATGGG AAAGGTTACA AGAAATCAAC CCGCGCCTTA TATATGCTTC ATCGTCAGGT TTCGGACATA CCGGTCCGCT AAAAGATGCT CCTGCCTACG ATACCATCAT TCAGGCAATG AGCGGGATAA TGATGGAAAC AGGATACCCT GATGCTCCGC CAGTGCGCGT TGGTACCTCT CTTGCGGATC TATGTGGTGG TGTTTATTTA TTCAGCGGAA TAGTGAGTGC ACTTTATGGC CGCGAAAAGA GCCAGAGAGG TGCGCATGTC GATATAGCGA TGTTTGATGC CACGCTGAGT TTTCTGGAGC ATGGACTGAT GGCATATATC GCGACTGGGA AGTCACCACA ACGCCTGGGA AATCGCCATC CCTACATGGC ACCTTTTGAT GTTTTTGATA CTCAGGATAA GCCGATTACA ATTTGTTGTG GTAATGACAA GCTTTTTTCT GCGTTATGCC AGGCACTGGA GCTTACAGAA CTGGTTAATG ATCCCCGATT TAGCAGCAAT ATTTTACGCG TACAAAACCA GGCTATTCTT AAACAATATA TTGAGCGAAC GTTAAAAACG CAGGCAGCTG AAGTTTGGTT AGCCAGAATA CATGAAGTTG GTGTACCCGT CGCGCCGTTA TTAAGTGTGG CTGAGGCCAT TAATTTGCCA CAAACTCAGG CGAGAAATAT GTTGATTGAA GCCGGAGGAA TAATGATGCC AGGTAATCCG ATAAAAATCA GCGGCTGCGC GGACCCGCAT GTTATGCCGG GAGCGGCAAC GCTCGACCAG CATGGGGAAC AAATTCGCCA GGAGTTCTCA TCATAA
|
Protein sequence | MTNNESKGPF EGLLVIDMTH VLNGPFGTQL LCNMGARVIK VEPPGHGDDT RTFGPYVDGQ SLYYSFINHG KESVVLDLKN DHDKSIFINM LKQADVLAEN FRPGTMEKLG FSWERLQEIN PRLIYASSSG FGHTGPLKDA PAYDTIIQAM SGIMMETGYP DAPPVRVGTS LADLCGGVYL FSGIVSALYG REKSQRGAHV DIAMFDATLS FLEHGLMAYI ATGKSPQRLG NRHPYMAPFD VFDTQDKPIT ICCGNDKLFS ALCQALELTE LVNDPRFSSN ILRVQNQAIL KQYIERTLKT QAAEVWLARI HEVGVPVAPL LSVAEAINLP QTQARNMLIE AGGIMMPGNP IKISGCADPH VMPGAATLDQ HGEQIRQEFS S
|
| |