Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4113 |
Symbol | |
ID | 8431127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4282416 |
End bp | 4283516 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645036308 |
Product | glycosyl transferase family 4 |
Protein accession | YP_003193406 |
Protein GI | 258517184 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000368413 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000004636 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCTTA AGCCGGTATT GCTTGTTCTG GCTTCCTTAT TTATTTCTCT GCTGGCAACT CCTCAAGTAA TTAAGCTGGC TTATCGCTGG GGGGCTCTGG ATCAGCCGGA TCCGCGCAAA GTTCACCGGA AGATTATGCC AAGACTGGGT GGTCTGGCAG TTTATTTAAG TTTTATTGCG GGTATTTTGT TAATGGGGCA GCTCACCTCT CCGGTGATAG GACTTATAAT AGGGTCAACC TTGATTGTTT TGTTGGGTAT TGTCGATGAT ATCAAAGGAA TTTCACCTCG AGTAAAATTA TTGGGACAGG TGCTGGTGGC TTTCTCAGTA CTCCCGTTTG GTATAAGTGT TGATTTTATA ACAAATCCTA TAAACGGAGA TATTCTGCAT CTGGGCTTTT TGAGTATCCC GGTAACCGTC TTCTGGCTGG TAGCCGTCAC CAACGCTGTT AACCTGATAG ACGGCCTGGA TGGTCTGGCC GGCGGTACCT CACTTATTTC GGCTGTTACT TTAGCTGTGG TATCATGGAC CCAGTGGCGG GTTTTTGGCC TGCCGGAACA AATGCAGGTA ATTTTGATGG CGTTGATATT GGCAGCCTCG CTGTTAGGAT TTTTGCGCTA CAACTTTAAT CCGGCCAAGA TATTTCTGGG CGATACAGGC TCAATGATGC TGGGTTTTTG CCTGGCTGCT ATGTCTGTCA TGGGTTTGAC CAAGAGTACT ACGGCTATTT CCGTAATTAT ACCTCTGGTT ATTCTGGGTA TTCCCCTGCT GGACACAGTA TTTGCCGTTG TGCGGCGCTA CAATATGCAT CAACCTATTT TTAAGGCGGA CAAGGAGCAC CTGCACCATC GCTTGCTGGC CCTGGGACTG AGCCATAAGC AGGCTGTTTT GGCTATTTAC GGTGTGAGTG CTTTTTTAGG ATTGAGTGCG GTCATGTTGA ATTTAATAAC GACTAACCAG GCAATACTGG TTCTGGTTGT ACTGGCAGTT GTGATTATTA CCGCAGCCAA TAAAATAGGC ATTATCGGAC ATAAAAGACA GCCTGCGTAT CAAATTTCGT CTGGGGCGGT CGAAATGGAA AAACGGTCCT CGGAAATATA G
|
Protein sequence | MVLKPVLLVL ASLFISLLAT PQVIKLAYRW GALDQPDPRK VHRKIMPRLG GLAVYLSFIA GILLMGQLTS PVIGLIIGST LIVLLGIVDD IKGISPRVKL LGQVLVAFSV LPFGISVDFI TNPINGDILH LGFLSIPVTV FWLVAVTNAV NLIDGLDGLA GGTSLISAVT LAVVSWTQWR VFGLPEQMQV ILMALILAAS LLGFLRYNFN PAKIFLGDTG SMMLGFCLAA MSVMGLTKST TAISVIIPLV ILGIPLLDTV FAVVRRYNMH QPIFKADKEH LHHRLLALGL SHKQAVLAIY GVSAFLGLSA VMLNLITTNQ AILVLVVLAV VIITAANKIG IIGHKRQPAY QISSGAVEME KRSSEI
|
| |