Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4737 |
Symbol | |
ID | 6143414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4834680 |
End bp | 4835903 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641619552 |
Product | hypothetical protein |
Protein accession | YP_001746660 |
Protein GI | 170682446 |
COG category | [S] Function unknown |
COG ID | [COG4269] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.243262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.802179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTTT TAATTCAGGG AATTTTTATG GCTCAAGTTA TTAATGATAT GGATGTTCCG TCCCATTCGT TTGTTTTTCA TGGTACAGGT GAGAGATATT TTCTTATTTG TGTGGTGAAT GTGTTGTTAA CGATTATAAC GCTAGGTATC TATTTACCAT GGGCATTAAT GAAATGTAAG CGTTATCTCT ATGCTAATAT GGAAGTTAAC GGACAACGAT TTTCTTATGG GATTACCGGT GGGAATGTTT TTGTTAGTTG TCTTGTTTTT GTTTTTTTCT ATTTCGCAAT CTTAATGACA GTGTCAGCAG ATATGCCGCT TATTGGCTGT GTTTTGACTT TGTCACTGTT GGTTTTGCTT ATATTTATGG CAGCAAAAGG ACTGCGTTAT CAGGCCTTGA TGACCAGTCT CAACGGCGTA AGATTTAGTT TTAATTGCTC TATGAAAGGG TTCTGGTGGG TGACCTTTTT CTTGCCGATT TTAATGGCCC TTGGGATGGG GACTGTTTTC TTTATCTCGA CAAAGATGCT ACATGCCAAT AGTTCAAGTA GCGTTATTAT ATCTGTGGTT CTGATGACAA TAGTTGGTAT TGTTTCCATT GGTATTTTTA ATGGCACTTT ATATAGTCTG GTAATGAGTT TTCTCTGGAG TAATACCAGT TTCGGTATAC ACCGTTTCAA GGTGAAATTA GATACTACAT ATTGTATAAA ATATGCCATT CTCGCATTTT TAGCTTTATT ACCTTTTCTC GCTGTTGCTG GTTATATTAT CTTCGATCAA ATATTAAATG AGTATGATAG TTCTGGGTAT GCAAATGATA ATATTGAGAA TTTACAGCAA TTTATGGAAA TGCAACGTAA AATGATAATC GCGCAGTTAA TCTATTATTT TGGGATTGCT GTTAGCACCA GTTATTTAAC GGTGTCGTTG CGAAATCATT TTATGAGCAA CCTGTCACTG AATGATGGGC GTATTCGTTT TCGCTCAACT TTAACGTACC ACGGTATGCT TTATCGCATG TGTGCGTTGG TGGTGATATC CGGGATTACG GGTGGTCTGG CTTACCCACT GCTGAAAATA TGGATGATTG ACTGGCAGGC AAAAAATACG TATTTGCTGG GCGATTTGGA TGACCTTCCT TTAATCAATA AAGAAGAACA ACCAGATAAA GGCTTCTTAG CCAGTATTTC ACGGGGAGTT ATGCCTTCTT TACCATTTCT GTAA
|
Protein sequence | MDFLIQGIFM AQVINDMDVP SHSFVFHGTG ERYFLICVVN VLLTIITLGI YLPWALMKCK RYLYANMEVN GQRFSYGITG GNVFVSCLVF VFFYFAILMT VSADMPLIGC VLTLSLLVLL IFMAAKGLRY QALMTSLNGV RFSFNCSMKG FWWVTFFLPI LMALGMGTVF FISTKMLHAN SSSSVIISVV LMTIVGIVSI GIFNGTLYSL VMSFLWSNTS FGIHRFKVKL DTTYCIKYAI LAFLALLPFL AVAGYIIFDQ ILNEYDSSGY ANDNIENLQQ FMEMQRKMII AQLIYYFGIA VSTSYLTVSL RNHFMSNLSL NDGRIRFRST LTYHGMLYRM CALVVISGIT GGLAYPLLKI WMIDWQAKNT YLLGDLDDLP LINKEEQPDK GFLASISRGV MPSLPFL
|
| |