Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4904 |
Symbol | mdoB |
ID | 6147481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 5025483 |
End bp | 5027774 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619707 |
Product | phosphoglycerol transferase I |
Protein accession | YP_001746814 |
Protein GI | 170682467 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0182447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAGAAC TACTCTCTTT CGCCCTTTTT CTCGCCTCTG TGCTGATTTA CGCATGGAAA GCGGGACGTA ACACCTGGTG GTTTGCAGCC ACGTTAACGG TGCTGGGGCT ATTTGTCGTT TTAAATATCA CCCTGTTTGC CAGCGACTAT TTTACTGGCG ATGGTATTAA CGACGCGGTT CTCTATACCT TAACCAACAG CCTGACCGGT GCTGGCGTCA GCAAATACAT TCTGCCGGGT ATCGGCATTG TGCTGGGGCT GACAGCGGTG TTCGGTGCGC TGGGCTGGAT CCTGCGCCGT CGTCGTCATC ATCCGCACCA TTTTGGTTAC AGCCTGCTGG CACTCTTACT GGCGCTGGGT TCAGTGGATG CCAGCCCGGC ATTTCGTCAG ATAACGGAAC TGGTGAAATC CCAGTCACGC GACGGCGACC CGGACTTTGC GGCTTATTAT AAAGAGCCGT CGAAAACTAT CCCTGACCCG AAACTCAACC TGGTTTATAT CTACGGCGAA AGTCTCGAGC GAACCTATTT TGATAACGAA GCTTTCCCGG ATCTCACGCC TGAACTGGGC GCGTTGAAAA ATGAAGGCCT GGATTTCAGC CACACGCAGC AGCTGCCAGG AACGGATTAC ACGATTGCGG GCATGGTGGC TTCTCAGTGC GGCATACCGC TGTTTGCCCC CTTTGAAGGC AACGCCTCCG CCTCTGTCTC CAGCTTCTTC CCGCAGAACA TCTGTCTGGG CGATATCCTG AAAAACTCGG GTTATCAGAA CTATTTCGTG CAGGGCGCGA ATCTGCGTTT TGCCGGTAAA GATGTGTTCC TGAAATCGCA CGGCTTCGAC CACTTATTCG GCTCAGAAGA GCTGAAAAGC GTGGTGGCCG ACCCGCACTA TCGCAACGAC TGGGGATTCT ACGACGATAC CGTTCTCGAT GAAGCGTGGA AAAAGTTTGA AGAGCTTTCC CGCTCAGGTC AGCGATTCTC ACTGTTTACC CTGACAGTCG ATACCCATCA CCCGGATGGT TTTATCTCGC GTACCTGTAA CCGCAAAAAA TATGATTTTG ACGGTAAGCC GAATCAGTCA TTCAGCGCGG TAAGTTGCAG TCAGGAGAAT ATCGCGACGT TTATCAACAA AATCAAAGCG TCACCGTGGT TTAAAGATAC TGTTATCGTC GTCTCTTCTG ACCATTTAGC GATGAACAAC ACGGCGTGGA AATACCTTAA TAAACAGGAT CGCAATAACC TGTTTTTTGT CATCCGTGGC GACAAGCCGC AGCAAGAGAC GCTGGCGGTA AAGCGTAACA CGATGGATAA CGGCGCGACG GTGCTGGACA TTCTCGGTGG CGATAACTAT CTCGGACTTG GTCGTAGCAG TTTATCCGGG CAGTCGATGT CGGAAATCTT CCTCAATATC AAAGAGAAAA CATTGGCGTG GAAGCCGGAT ATCATCCGCC TGTGGAAATT CCCTAAAGAG ATGAAAGAGT TCACCATCGA CCAGCAGAAA AACATGATTG CCTTCTCGGG TAGCCATTTC CGTTTGCCGC TGCTGTTGCG GGTTTCAGAC AAACGCGTGG AACCGCTGCC GGAAAGTGAA TACTCAGCAC CGCTGCGTTT CCAGCTGGCC GATTTCGCTC CACGCGACAA TTTCGTCTGG GTTGACCGTT GCTACAAGAT GGCACAACTT TGGGCTCCGG AACTGGCACT CTCCACCAAC TGGTGTGTCT CGCAAGGGCA GCTTGGCGGT CAGCAAATTG TCCAGCATGT TGACAAAACA ACATGGAAGG GCAAAACGGC ATTTAAAGAT ACGGTCATCG ACATGGCGCG TTACAAAGGC AATGTCGATA CGCTGAAGAT TGTTGATAAC GATATTCGCT ACAAAGCCGA CAGTTTCATC TTTAACGTCG CCGGTGCGCC GGAAGAGGTG AAACAGTTTA GCGGGATTTC TCGTCCGGAG TCGTGGGGCC GCTGGTCCAA CGCTCAGCTG GGCGATGAAG TAAAAATCGA GTATAAGCAT CCGCTGCCGA AGAAATTTGA CCTGGTGATT ACCGCCAAAG CATACGGCAA TAACGCCAGC CGTCCTATTC CGGTACGCGT AGGCAATGAA GAACAAACCC TTGTACTGGG CAATGAAGTG ACCACCACCA CGCTGCATTT CGATAACCCA ACCGATGCCG ACACGCTGGT AATTGTGCCG CCGGAACCTG TCTCAACCAA CGAAGGGAAT ATCCTCGGAC ACTCGCCGCG TAAGCTCGGG ATCGGCATGG TGGAGATTAA AGTGGTAGAA CGTGAAGGGT AA
|
Protein sequence | MSELLSFALF LASVLIYAWK AGRNTWWFAA TLTVLGLFVV LNITLFASDY FTGDGINDAV LYTLTNSLTG AGVSKYILPG IGIVLGLTAV FGALGWILRR RRHHPHHFGY SLLALLLALG SVDASPAFRQ ITELVKSQSR DGDPDFAAYY KEPSKTIPDP KLNLVYIYGE SLERTYFDNE AFPDLTPELG ALKNEGLDFS HTQQLPGTDY TIAGMVASQC GIPLFAPFEG NASASVSSFF PQNICLGDIL KNSGYQNYFV QGANLRFAGK DVFLKSHGFD HLFGSEELKS VVADPHYRND WGFYDDTVLD EAWKKFEELS RSGQRFSLFT LTVDTHHPDG FISRTCNRKK YDFDGKPNQS FSAVSCSQEN IATFINKIKA SPWFKDTVIV VSSDHLAMNN TAWKYLNKQD RNNLFFVIRG DKPQQETLAV KRNTMDNGAT VLDILGGDNY LGLGRSSLSG QSMSEIFLNI KEKTLAWKPD IIRLWKFPKE MKEFTIDQQK NMIAFSGSHF RLPLLLRVSD KRVEPLPESE YSAPLRFQLA DFAPRDNFVW VDRCYKMAQL WAPELALSTN WCVSQGQLGG QQIVQHVDKT TWKGKTAFKD TVIDMARYKG NVDTLKIVDN DIRYKADSFI FNVAGAPEEV KQFSGISRPE SWGRWSNAQL GDEVKIEYKH PLPKKFDLVI TAKAYGNNAS RPIPVRVGNE EQTLVLGNEV TTTTLHFDNP TDADTLVIVP PEPVSTNEGN ILGHSPRKLG IGMVEIKVVE REG
|
| |