Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4885 |
Symbol | |
ID | 6146100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5002090 |
End bp | 5003046 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619689 |
Product | hypothetical protein |
Protein accession | YP_001746796 |
Protein GI | 170679958 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.287711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACT TCACGACCAG CACGCCGCAT GACGCATTAT TTAAATCTTT TCTCACCCAC CCTGACACCG CGCGGGATTT TATGGAGATC CACTTACCCA AAGATTTACG TGAACTGTGC GATCTCGACA GCTTAAAACT GGAATACGCC AGCTTTGTCG ATGAAAAATT GCGGGCGCTA CATTCCGATA TTCTGTGGTC GGTAAAGACC CGCGAAGGAG ATGGCTATAT TTATGTGGTG ATTGAACATC AGAGCCGCGA GGATATCCAT ATGGCCTTTC GCCTGATGCG ATATTCCATG GCGGTGATGC AGCGCCATAT CGAGCATGAT AAACGCCGGC CGCTACCGCT GGTCATCCCG ATGCTGTTTT ATCACAGTAG CCGTAGTCCT TACCCCTGGT CTTTGTGCTG GCTGGATGAA TTTGCCGACC CGGCTACCGC GCGGAAGCTT TATACCGCAG CGTTCCCGCT GGTGGATGTC ACTGTCGTGC CAGACGACGA GATTGTGCAG CATCGCAGAG TCGCCCTGCT GGAGTTGATC CAAAAGCATA TTCGCCAGCG CGATCTAATG GGGCTTATTG ATCAACTGGC AGTATTACTG GTTACAGGGT GCGCTAATGA CAGCCAGATA ACCGCGCTGT TAAATTACAT TTTACTGACT GGCGATGAAG CGCGTTTTAA TGAGTTTATC AGCGAACTTA TCCGTCGAAT GCCACAACAC AGGGAGCGAA TAATGACGAT TGCAGAGCGA ATTCATAATG ATGGATGGCT GTTGGGAAGG GAGAGGGGGA GGAAAGAAGG GAAAGAAGAA GGGGAAAAGA GCCTCCTCCG ATTGTTGTTG CAGAATGGGG CGGATCCTGA ATGGATACAA CGATATACCG GACTTTCGGC AGAGCAAATG CAGGCATTAG AGCAGCCCTT GCCTGAAAGC AAGCGCGATC CATGGATCGA GTACTAA
|
Protein sequence | MTNFTTSTPH DALFKSFLTH PDTARDFMEI HLPKDLRELC DLDSLKLEYA SFVDEKLRAL HSDILWSVKT REGDGYIYVV IEHQSREDIH MAFRLMRYSM AVMQRHIEHD KRRPLPLVIP MLFYHSSRSP YPWSLCWLDE FADPATARKL YTAAFPLVDV TVVPDDEIVQ HRRVALLELI QKHIRQRDLM GLIDQLAVLL VTGCANDSQI TALLNYILLT GDEARFNEFI SELIRRMPQH RERIMTIAER IHNDGWLLGR ERGRKEGKEE GEKSLLRLLL QNGADPEWIQ RYTGLSAEQM QALEQPLPES KRDPWIEY
|
| |