Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3833 |
Symbol | |
ID | 4074983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | - |
Start bp | 78296 |
End bp | 79729 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638004491 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_611226 |
Protein GI | 99077967 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.00330118 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.252696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ACTATTTTGA TATCACGGAT ATCGTTCTCT ACGTCAAAAA AGAAACGACA ATATCTGGGA TACAGCGCGT TGCTCTGGAA GTTATTAGGC GTGCAGTAGG CAAGCTCGGA TCTGAGCGCG TCAAGGTAGG CATGTGGGAC AAAGGTTCTC AAAGCTATCT GGCGATGGAT GCCGATTTCC TTCTTGAACG GGAAGAATTT GACCCAGATT TTCTAGCTGC TGCTTTGCTG GGGAAGGCTG TGCGCAAGGT GCAAACAGTA CCCAGTATTC TTACCCGGTA TCGCAACCAG CGCGCGAAAT ACTTGTATCA TTGGCTGCGA GCTAACCTGG AAGAATACAA GCGAAACCAT CGTTATTTTG AGCGACGCGG CACTACTTTA GAAGGCTGGG TGCTCGAGAA AGAAAGAGCC CATAGAAACT CAAAGGAGCT GCCGGTTCTC GCGCCTGCAG CATCGGTACA AAAGCGGGTT CCGCTCGAAG ACGTGGCGCA ATCAGGCGAC CGTCTGATTA TTTTAGGTGC GACTTGGGGG CTGCAAGACC TCAACGACCA TCTTATTGAA TTAAAAGAGA AACTTGGAGT TGAAGTTGAT CTCCTTATTC ACGATCTAAT CCCGCTGGTC GCGTCAGAAC ATCTTGCGGA TGATTTTTCT GAGACATTCT ACCGCTGGCT GGAGGGATCT ACTCTCTATT GTTCGCGTTA TTTTGCAAAT TCTCAGAACA CAGGGAAGGA TTTACGATGC TTCCTAAGCG AAATTAACTC CGACCTTCCA ATCGATGTTG TGCCGTTGGC ACAAGCACTG GGAGATGGCC CTGCGGCCCT AGATACTCAA AGCTTTAGAT CCAAACTGAA CGCGACAAAG GGTGTGCGCA GATCGTTTCT GAACATGACA AAGGTACCGT ATGTTCTCGT TGTTGGTACG CTTGAAACAA GAAAGAATAT CTGGCGTTTA GCTCAGGCCT GGCAGCAACT CATACAAGAT CCTCAAGTAG AACCTCCGCG CCTAATTTTT GCGGGGAAGA AGGGGTGGTA CATCGATGAA TTTCTGGATT GGATGAAGGC GAGTGGAAAT CTTGATGGCT GGATTAGTAT CGCGGATAGG CCGACTGACA AGGAATTAGC CTTCCTCTTC CACAATTGCG TATTTACAGC AAACGTCTCC ACATATGAGG GATGGGGTTT GCCAGTGGGA GAAGGCCTAA GCTTCGGCAA AACAGGTGTT GTCGCCGAAA ATTCCTCACT GACAGAGGTC GGTGGAGACA TGGTCGAGTA CTGTGATGCT CATTCGATCA GTAGTATTCG AGACGCGTGT AAGCGCTTGA TCATGGAAGA TGGTAGGCGA ACGGAGCTAG AGCAACGGAT TAAGAACACG CAGTTACGTA GTTGGGATGA TGTGACCAAC GATTTGCTGG AGTACCTTAA CTAG
|
Protein sequence | MQTYYFDITD IVLYVKKETT ISGIQRVALE VIRRAVGKLG SERVKVGMWD KGSQSYLAMD ADFLLEREEF DPDFLAAALL GKAVRKVQTV PSILTRYRNQ RAKYLYHWLR ANLEEYKRNH RYFERRGTTL EGWVLEKERA HRNSKELPVL APAASVQKRV PLEDVAQSGD RLIILGATWG LQDLNDHLIE LKEKLGVEVD LLIHDLIPLV ASEHLADDFS ETFYRWLEGS TLYCSRYFAN SQNTGKDLRC FLSEINSDLP IDVVPLAQAL GDGPAALDTQ SFRSKLNATK GVRRSFLNMT KVPYVLVVGT LETRKNIWRL AQAWQQLIQD PQVEPPRLIF AGKKGWYIDE FLDWMKASGN LDGWISIADR PTDKELAFLF HNCVFTANVS TYEGWGLPVG EGLSFGKTGV VAENSSLTEV GGDMVEYCDA HSISSIRDAC KRLIMEDGRR TELEQRIKNT QLRSWDDVTN DLLEYLN
|
| |