Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0846 |
Symbol | |
ID | 6146036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 849475 |
End bp | 850740 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615734 |
Product | hypothetical protein |
Protein accession | YP_001742926 |
Protein GI | 170683552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTA CTTTTACCAG CGACACATTG CCTGCCGATC ACAAAGCAGC TATCCGTCAG ATGAAGCACG CGCTGCGGGC GCAGCTTGGC GACGTCCAGC AGATCTTTAA TCAGCTAAGC GATGACATTG CCACGCGAGT GGCTGAAATC AACGCACTCA AAGCACAGGG CGATGCAGTC TGGCCGGTGC TGTCTTATGC CGATATCAAA GCAGGTCATG TTACTGCAGA GCAACGCGAA CAGATTAAAC GTCGCGGTTG TGCGGTGATA AAAGGCCATT TCCCCCGCGA ACAAGCGCTA GGCTGGGATC AGTCGATGCT GGACTATCTG GACCGCAACC GCTTTGATGA GGTCTACAAA GGCCCCGGCG ATAATTTCTT CGGGACGCTC AGCGCTTCAC GTCCCGAGAT TTACCCCATC TACTGGTCGC AGGCGCAAAT GCAGGCCCGC CAGAGTGAAG AAATGGCGAA TGCGCAGTCG TTTCTCAATC GTCTGTGGAC ATTTGAAAGT GATGGAAAGC AATGGTTTAA CCCGGATGTG AGCGTCATCT ACCCTGACCG TATCCGCCGC CGTCCGCCCG GAACGGCCTC CAAAGGTCTC GGAGCGCATA CCGACTCCGG GGCACTGGAA CGCTGGCTGC TTCCAGCGTA TCAGCGCGTT TTCGCCAACG TCTTTAATGG CAATCTGGCG CAATATGATC CCTGGCATGC GGCACATCGT ACAGAAGTTG AAGAGTACAC GGTGGACAAC ACCACCAAAT GTTCCGTGTT TCGGACATTC CAGGGCTGGA CAGCGCTCTC TGATATGCTG CCTGGTCAGG GACTGCTGCA CGTCGTGCCC ATTCCTGAAG CCATAGCGTA CGTGCTGTTA CGTCCGCTGC TTGATGATGT GCCGGAGGAT GAACTGTGCG GCGTAGCGCC CGGAAGAGTG TTGCCGGTAT CAGAGCAATG GCATCCACTG TTGATTGAGG CATTAACCAG CATTCCACAA CTCGAAGCCG GAGACTCCGT CTGGTGGCAC TGCGACGTCA TCCATTCCGT TGCCCCCGTT GAAAATCAAC AGGGCTGGGG CAACGTGATG TACATTCCTG CGGCACCGAT GTGCGAGAAA AATCTTGCCT ACGCGCACAA GGTGAAGGCC GCACTGGAAA AAGGCGCATC ACCTGGCGAC TTCCCGCGGG AAGATTATGA AGCAAACTGG GAAGGCCGCT TTACGCTGGA GGATCTGAAC ATTCACGGTA AGCGCGCACT GGGCATGGAT GTTTGA
|
Protein sequence | MASTFTSDTL PADHKAAIRQ MKHALRAQLG DVQQIFNQLS DDIATRVAEI NALKAQGDAV WPVLSYADIK AGHVTAEQRE QIKRRGCAVI KGHFPREQAL GWDQSMLDYL DRNRFDEVYK GPGDNFFGTL SASRPEIYPI YWSQAQMQAR QSEEMANAQS FLNRLWTFES DGKQWFNPDV SVIYPDRIRR RPPGTASKGL GAHTDSGALE RWLLPAYQRV FANVFNGNLA QYDPWHAAHR TEVEEYTVDN TTKCSVFRTF QGWTALSDML PGQGLLHVVP IPEAIAYVLL RPLLDDVPED ELCGVAPGRV LPVSEQWHPL LIEALTSIPQ LEAGDSVWWH CDVIHSVAPV ENQQGWGNVM YIPAAPMCEK NLAYAHKVKA ALEKGASPGD FPREDYEANW EGRFTLEDLN IHGKRALGMD V
|
| |