Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1209 |
Symbol | |
ID | 6147283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1214150 |
End bp | 1215190 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641616087 |
Product | hypothetical protein |
Protein accession | YP_001743270 |
Protein GI | 170682042 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.852584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.278976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACAT CGACAGTAAT TCCTGAAGAC ATCAAAACGC TAAAATCCGA CGTTAGCAAA TTAAAAAACG ATCAAGGAAG CTACGCAACA AAATCATATG TAGACAATAA AACAACATGG AATAGTTATT GCAATGTAAT CTATGATCAA AAGACATTGC CGACCACTGG AACTATATTT AGCGGTAAGA TTCACTTGTC AAATAAGACA GGAGAAACCG AAAACGCTTA TAGTGAGATG TACACTAGAA AAAATATTGA CGGTACAAAA GATACAATGA CAAGGATTGT CACACACAAT GGAACAAAAG GTATCTTTTG GGATTTTAGC GATCTTTACG GTGGAACATT AATTTTTCCA GGCAGTGATG GTTACCTTAA GATGGGGAAC TGTCTCATGT CGTATGGTGT GCGGGGAAGT AACGCGCTTA TTAAGTTTGA CTGCACAGAC ACATTACAAA TCAAATATGC CAATCATGGG TCAACCATGA CAATCAACAC ACAGGGAACC GCTCATTCTG GCGCTACTAC TAGTTTGTGG GGTAACTCTA CCCGTCCGGT TGTATATGAA GTTGGTGCTG ATGGTGGCGC TTATATGTTC TATGCGCAGA AAAATACCGA TAACACCTAT ATGTTAAGCG TTAATGGTGC ATGTCATGCC ACCGCATTTA ACCAGCATTC CGACCGGGAT CTGAAAGACA ACATTCAGGT GATCGATAAT GCAACCGACC GCATCCGTAA AATGAACGGC TATACATACA CGCTTAAAGA AAACGGTATG CCCTATGCTG GTGTCATTGC ACAGGAAGCT CTGGAAGCAA TCCCAGAAGT TGTAGGTTCC GCAATGAAAT ATCAGGACGG TGCAAGCGGA TCGGAAGGTG AAGAAGGTGA ACGTTATTAC ACAGTAGATT ATTCTGGTGT TACTGGCTTG CTTGTTCAGG TAGCCAGAGA GTCAGACGAC AGAATAACAG CACTGGAAGA AGAAAACGCA GAATTAAGAC AAAGATTATC TGCAATTGAG GCGGCGCTTG CGTCTAAATA A
|
Protein sequence | MATSTVIPED IKTLKSDVSK LKNDQGSYAT KSYVDNKTTW NSYCNVIYDQ KTLPTTGTIF SGKIHLSNKT GETENAYSEM YTRKNIDGTK DTMTRIVTHN GTKGIFWDFS DLYGGTLIFP GSDGYLKMGN CLMSYGVRGS NALIKFDCTD TLQIKYANHG STMTINTQGT AHSGATTSLW GNSTRPVVYE VGADGGAYMF YAQKNTDNTY MLSVNGACHA TAFNQHSDRD LKDNIQVIDN ATDRIRKMNG YTYTLKENGM PYAGVIAQEA LEAIPEVVGS AMKYQDGASG SEGEEGERYY TVDYSGVTGL LVQVARESDD RITALEEENA ELRQRLSAIE AALASK
|
| |