Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2119 |
Symbol | |
ID | 5058582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 2396617 |
End bp | 2397834 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640474382 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001158948 |
Protein GI | 145594651 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.111121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0181031 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG GGATCCTGGC GTATCACTTC CCACCGGAAC CGGCATTCAT CCCGGGCAGC CTCGCGGAGG AACTGGCCCG CCGCGGCCAC GAAGTCCGGG TGCTGACCGG ATTTCCCGAC TATCCGGGTG GGTACGTCTA CCCGGGCTGG CGGCAGCGTT GGCGCCACCA GACCCGCAGC GAGCGGCTGA CCGTGCGGCG GGTGCCCCGC TACGTCGGCC GCAGTGGCTC CGAGCGCGGC CGGATGGCCG GTCACCTCTC CTTCGCGGGC AGTGTGTCGC TGGTCGGCCG GCGGTTCTTC GCCGGTGTCG ACGCGCTCTA CGTTCATCAG CCGCCGGCCA CCGCCTTCGC CGCGGCCCGC CTGCTTCGGG CGCTTCGTCG GGTGCCGGCC GTCGTGCACG TTCAGGACGT GTGGGCTGGT CCGAAGCCGG CGGCCGGCGG GGGTGATCGG TGGGCCGCCC GGCTTGCCGG TGCGATGGCC GCTACCTACC GCCACGCCGA CCGGATCGTG GTGGCGGCGC CCTCGCTGCG GGACGTTGTG GTGACCGAGG GAGCCGACCC GGGCCGCGTC GAGGTGGTGC TCAACTGGAC CGACGAGCGG ATCTTCCAGC CGGCTCCGCC GAGCCCGGCC GCTGGTCAAC TAGTCCGGCG CGACGGCCGC TGCGTGGTCA TGTACGCCGG CACCATCGGT GCCCGGCAGG GGCTGGATAC GGCGGTGCGG GCGGCGGCAG CGCTCGACCA CAGGATGGAG CTGGTGCTGG TCGGGTCGGG TGAGCAGGAG CGGCGGGTGC GGGGGCTCGC CGCCGAGCTG GGCGCCGACA ACGTGCGGTT CGTCGAACGG CGCTCGCCGT TGGACATGCC GGAGCTGTAC GCGGCTGCCG ACTACCAGTT GGTCATGCTC CGGGACCTGC CCGAACTACG CAGCACCCTG CCCGGCAAGC TGCCTACCGC CCTGTCGTGC GGGGCGCCGG TCATCGCCTC GGCCGGCGGC GACACCGCCG AGGTGGTGGA GAGTGCTCGC GCCGGACTGT CGTGTCCGCC GGAGGAGTGG GAGACCCTTG CCGACCGGTT CTGGTTGGCC GCCACCATCC CTCCGGCCGC CCGTGCCGAG ATGGGCCGGC GGGGCCGGGA GGCGTACCTG CGGCAGATGT CGATGCCGGC CGGAGTGGAA CGGATCGAAT GCCTGCTGGA CGAGGCCGCC AGCGGACGCC GACGATGA
|
Protein sequence | MKIGILAYHF PPEPAFIPGS LAEELARRGH EVRVLTGFPD YPGGYVYPGW RQRWRHQTRS ERLTVRRVPR YVGRSGSERG RMAGHLSFAG SVSLVGRRFF AGVDALYVHQ PPATAFAAAR LLRALRRVPA VVHVQDVWAG PKPAAGGGDR WAARLAGAMA ATYRHADRIV VAAPSLRDVV VTEGADPGRV EVVLNWTDER IFQPAPPSPA AGQLVRRDGR CVVMYAGTIG ARQGLDTAVR AAAALDHRME LVLVGSGEQE RRVRGLAAEL GADNVRFVER RSPLDMPELY AAADYQLVML RDLPELRSTL PGKLPTALSC GAPVIASAGG DTAEVVESAR AGLSCPPEEW ETLADRFWLA ATIPPAARAE MGRRGREAYL RQMSMPAGVE RIECLLDEAA SGRRR
|
| |