Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1033 |
Symbol | |
ID | 6144602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1054799 |
End bp | 1055779 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641615920 |
Product | chain length determinant protein |
Protein accession | YP_001743112 |
Protein GI | 170680449 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3765] Chain length determinant protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.205095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTAG AAAATAATAA TGTTTCTGGG CAAAACCATG ACCCGGAACA GATTGATTTG ATTGATTTAC TAGTGCAGTT GTGGCGTGGC AAGATGACAA TCATCATTTC CGTCATTGTG GCTATTGCCC TGGCTATTGG ATATTTGGCA GTAGCGAAGG AGAAATGGAC GTCAACAGCA ATTATCACTC AGCCCGATGT GGGGCAAATT GCTGGCTATA ACAATGCCAT GAATGTTATC TATGGTCAGG CTGCACCGAA AGTATCGGAT TTGCAGGAGA CGTTAATTGG TCGCTTCAGT TCTGCCTTCT CTGCATTAGC AGAAACGCTG GATAATCAGG AAGAACCAGA AAAACTTACC ATCGAACCTT CTGTTAAGAA CCAGCAATTA CCATTGACTG TTTCTTATGT TGGGCAAACT GCAGAGGGCG CACAAATGAA GTTGGCCCAA TACATTCAGC AAGTTGACGA TAAAGTGAAT CAAGAGTTAG AAAAGGATCT CAAGGACAAC ATTGCTCTGG GACGGAAAAA CTTGCAGGAC TCTTTAAGAA CGCAGGAAGT GGTTGCGCAG GAGCAGAAAG ATCTGCGTAT CCGTCAGATT CAGGAAGCGT TGCAGTATGC GAATCAGGCG CAGGTGACAA AACCGCAGAT TCAACAGACT GGCGAAGATA TCACACAAGA TACGTTGTTC CTTTTGGGGA GCGAAGCGCT GGAGTCGATG ATTAAGCATG AGGCGACCCG TCCGTTGGTG TTCTCACCAA ACTACTATCA GACTCGTCAA AACCTGCTTG ATATCGAAAG CTTAAAGGTT GATGATCTTG ATATTCATGC TTACCGCTAT GTAATGAAAC CGACGTTACC TATTCGTCGT GATAGCCCGA AAAAGGCAAT TACCTTGATT CTGGCGGTGC TGCTGGGTGG CATGGTTGGC GCGGGGATTG TGCTGGGGCG TAATGCTCTA CGCAATTACA ACGCGAAGTA A
|
Protein sequence | MRVENNNVSG QNHDPEQIDL IDLLVQLWRG KMTIIISVIV AIALAIGYLA VAKEKWTSTA IITQPDVGQI AGYNNAMNVI YGQAAPKVSD LQETLIGRFS SAFSALAETL DNQEEPEKLT IEPSVKNQQL PLTVSYVGQT AEGAQMKLAQ YIQQVDDKVN QELEKDLKDN IALGRKNLQD SLRTQEVVAQ EQKDLRIRQI QEALQYANQA QVTKPQIQQT GEDITQDTLF LLGSEALESM IKHEATRPLV FSPNYYQTRQ NLLDIESLKV DDLDIHAYRY VMKPTLPIRR DSPKKAITLI LAVLLGGMVG AGIVLGRNAL RNYNAK
|
| |