Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2105 |
Symbol | |
ID | 6144740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2113758 |
End bp | 2114885 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616981 |
Product | hypothetical protein |
Protein accession | YP_001744156 |
Protein GI | 170682015 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2822] Predicted periplasmic lipoprotein involved in iron transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.258925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA ACTTCCGCCG TAACGCATTG CAGTTGAGCG TGGCTGCGCT GTTCTCTTCT GCTTTTATGG CTAACGCCGC TGATATTCCG CAAGTCAAAG TGACCGTAAC GGATAAGCAG TGCGAACCGA TGACCATTAC GGTTAACGCC GGGAAAACAC AGTTCATTAT TCAGAACCAC AGCCAGAAGG CGCTGGAGTG GGAGATCCTG AAAGGCGTGA TGGTGGTGGA AGAGCGGGAA AATATCGCCC CTGGCTTTAG CCAGAAAATG ACGGCGAATT TACAGCCTGG CGAATACGAT ATGACCTGCG GTCTGCTGAC TAACCCGAAA GGGAAGTTGA TCGTCAAAGG TGAGGCAACG GCGGATGCGG CGCAAAGTGA TGCGCTGTTA AGTCTTGGTG GTGCAATTAC TGCATATAAA GCGTATGTCA TGGCGGAAAC CACGCAGCTG GTGACCGACA CCAAAGCCTT TACCGACGCG ATTAAAGCAG GCGATATCGA AAAAGCGAAA GCACTGTATG CGCCGACGCG CCAGCACTAT GAGCGCATTG AACCGATTGC TGAACTGTTC TCCGATCTGG ATGGCAGCAT TGACGCCCGT GAAGATGATT ACGAGCAAAA AGCCGCCGAT CCAAAATTCA CCGGTTTCCA CCGTCTGGAA AAAGCATTGT TTGGCGACAA CACCACCAAA GGGATGGATA AGTACGCTGA CCAGCTTTAT ACCGATGTGG TCGATTTGCA AAAACGCATC AGTGAACTGG CTTTCCCACC TTCAAAAGTG GTCGGCGGTG CAGCCGGACT GATTGAGGAA GTGGCAGCCA GCAAAATCAG CGGTGAAGAA GATCGCTACA GCCACACCGA TCTGTGGGAT TTCCAGGCTA ACGTTGAAGG CTCGCAGAAA ATTGTCGATC TGCTGCGTCC ACAACTACAA AAAGCGAATC CGGAACTGCT GGCAAAAGTC GATGCCAATT TCAAAAAGGT CGATACCATT CTGGCGAAAT ACCGTACTAA AGACGGTTTT GAAACCTACG ACAAATTGAC CGATGCTGAC CGTAATGCGC TGAAAGGGCC GATTACTGCG CTGGCGGAAG ATCTGGCGCA ACTTCGCGGT GTGCTGGGGC TGGACTAA
|
Protein sequence | MTINFRRNAL QLSVAALFSS AFMANAADIP QVKVTVTDKQ CEPMTITVNA GKTQFIIQNH SQKALEWEIL KGVMVVEERE NIAPGFSQKM TANLQPGEYD MTCGLLTNPK GKLIVKGEAT ADAAQSDALL SLGGAITAYK AYVMAETTQL VTDTKAFTDA IKAGDIEKAK ALYAPTRQHY ERIEPIAELF SDLDGSIDAR EDDYEQKAAD PKFTGFHRLE KALFGDNTTK GMDKYADQLY TDVVDLQKRI SELAFPPSKV VGGAAGLIEE VAASKISGEE DRYSHTDLWD FQANVEGSQK IVDLLRPQLQ KANPELLAKV DANFKKVDTI LAKYRTKDGF ETYDKLTDAD RNALKGPITA LAEDLAQLRG VLGLD
|
| |