Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3909 |
Symbol | |
ID | 6145085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3979336 |
End bp | 3981306 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618735 |
Product | hypothetical protein |
Protein accession | YP_001745874 |
Protein GI | 170681898 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.921384 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTT CGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGTCAG TACCAACAAC TGGTCCGCGA TGTGGTGATT CCTTATCAGT GGGACGCCTT GAACGATCGT ATCCCAGAAG CGGAACCCAG CCATGCGATT GAAAACTTTC GCATTGCCGC AGGACTGCAA GACGGTGAAT TTTACGGGAT GGTGTTTCAG GACAGCGACG TCGCCAAATG GCTGGAAGCG GTAGCCTGGT CGCTGTGCCA GAAGCCGGAC GCCGAACTGG AAAAAACCGC CGACGAGGTG ATTGAACTGA TCGCCTCCGC CCAGTGTGAA GATGGCTATC TCAATACTTA CTTTACGGTA AAAGCGCCCG AAGAACGCTG GAGCAATCTG GCGGAGTGTC ATGAACTTTA CTGCGCGGGT CATCTGATTG AAGCCGGAGT CGCCTTCTTC CAGGCTACGG GCAAGCGGCG CTTGCTGGAA GTGGTTTGCC GTCTGACCGA TCATATCGAC AGCGTATTTG GTCCAGATGA AAGTAAGTTA CACGGTTATC CTGGTCACCC GGAAATTGAA CTGGCACTAA TGCGCCTGTA TGAAGTGACC GAAGAGCCGC GCTACCTGGC GCTGACGAAC TATTTTGTCG AACAGCGTGG TGCGCAACCG CACTATTACG ACCAGGAATA TGAAAAGCGC GGGCAGACCT CGCACTGGCA CACCTACGGC CCGGCGTGGA TGGTGAAAGA CAAAGCCTAC AGCCAGGCAC ATTTGCCCAT CGCACAGCAG CAAACCGCCA TTGGTCACGC GGTACGTTTT GTCTACCTGA TGACCGGCGT CGCGCATCTC GCGCGTTTAA GTCACGATGA AAGCAAGCGT CAGGATTGCT TGCGGCTGTG GAACAATATG GCCCAGCGTC AGTTATATAT TACCGGCGGC ATTGGCTCAC AGAGCAGCGG TGAAGCGTTC AGCAGCGATT ACGATCTGCC GAATGACACG GTATACGCCG AAAGCTGTGC TTCCATCGGC CTGATGATGT TCGCCCGGCG AATGCTGGAA ATGGAAGGCG ACAGTCAATA TGCCGATGTG ATGGAGCGCG CACTGTACAA CACCGTGCTC GGCGGCATGG CGCTGGATGG CAAACATTTC TTCTATGTGA ATCCGCTGGA AGTACATCCA AAATCGCTGA AATTCAACCA TATCTACGAT CACGTTAAGC CGATCCGCCA GCGTTGGTTT GGTTGCGCTT GTTGTCCGCC AAATATCGCC CGCGTGCTAA CCTCGATTGG TCATTATCTC TACACGCCGC GTGAAGATGC GTTGTATATC AACATATACG CAGGAAACAG CATGGAAGTG CCGGTAGAAA ATGGCACGTT GCGCCTGCGG GTTAGCGGAA ACTATCCGTG GCAGGAACAG GTGACAATTG CGGTTGAATC GCCCCAGCCG GTGCGTCATA CGCTGGCTTT ACGTCTGCCG GACTGGTGCA CACAGCCGCA GATTACATTG AATGGGGAAG AGGTCGAGCA GGATATTCGT AAAGGGTATT TGCACATTAC CCGCGAATGG CAGGAGGGCG ACACGCTGAA TCTGACGTTG CCAATGCCGG TACGTCGCGT TTACGGTAAC CCGCTGGTGC GTCACGTCGC CGGAAAAGTG GCGATTCAGC GCGGCCCGCT GGTGTATTGC CTGGAACAGG CCGACAACGG CGAGTCACTG CATAACCTGT GGCTGCCCGC CGATGCGCCA TTTACGACAT TTGAAGGCAA GGGATTGTTC CGCCATAAGA TCTTAATCCA GGCACCGGGT TACCGGTATG AACAGAGCAA TCCAGAGCAG CAACCGCTGT GGCATTACGA CTATGCCCCA GCCAAACGCC AGCCGCAAAC TCTGACATTT ATCCCGTGGT TTAGCTGGGC TAACCGGGGT GAAGGCGAAA TGCGGATCTG GGTGAATGAG GAAAAGCATT GCCATCCGTA G
|
Protein sequence | MNISEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGLQ DGEFYGMVFQ DSDVAKWLEA VAWSLCQKPD AELEKTADEV IELIASAQCE DGYLNTYFTV KAPEERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLE VVCRLTDHID SVFGPDESKL HGYPGHPEIE LALMRLYEVT EEPRYLALTN YFVEQRGAQP HYYDQEYEKR GQTSHWHTYG PAWMVKDKAY SQAHLPIAQQ QTAIGHAVRF VYLMTGVAHL ARLSHDESKR QDCLRLWNNM AQRQLYITGG IGSQSSGEAF SSDYDLPNDT VYAESCASIG LMMFARRMLE MEGDSQYADV MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLKFNHIYD HVKPIRQRWF GCACCPPNIA RVLTSIGHYL YTPREDALYI NIYAGNSMEV PVENGTLRLR VSGNYPWQEQ VTIAVESPQP VRHTLALRLP DWCTQPQITL NGEEVEQDIR KGYLHITREW QEGDTLNLTL PMPVRRVYGN PLVRHVAGKV AIQRGPLVYC LEQADNGESL HNLWLPADAP FTTFEGKGLF RHKILIQAPG YRYEQSNPEQ QPLWHYDYAP AKRQPQTLTF IPWFSWANRG EGEMRIWVNE EKHCHP
|
| |