Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2132 |
Symbol | |
ID | 5058595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 2409728 |
End bp | 2411887 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640474395 |
Product | glycosyl transferase family protein |
Protein accession | YP_001158961 |
Protein GI | 145594664 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00802467 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00453375 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACAGAA CCGAGACAAC CTACGGGCCC CAGGCGGCCG CCCCGGCGAC GAGCGTCACT TCGACGTCGC CGGCAGAGTC GGAGGCAGAG CCGACCTCCA CGCCGCGTCG GTCGGCCCGT TGGGCCCGCC TCTGCCTCGG TGGGCTCCTG CTCGCCACCG CTTGGCTCTA CCTGTGGGGG CTCGACGTCT CCGGTTGGGC GAACGCCTAC TACTCGGCGG CGGCGCAGGC CGGCGCAGAG AACTGGACCG CCCTCTTCTA CGGCTCGTCG GATGCCGCCA ACTCCATCAC CGTTGACAAG ACACCCGCCG CGCTGTGGCT GATGGCGCTC TCGGTGCGGC TGTTCGGCCT GAACAGCTGG GCGGTGCTGC TGCCGCAGGC GCTGTGCGGG GTGGCCGCGG TCGCGGTGCT CTATGCCACG GTACGGCGCT GGTATGGCCC GGCAGCAGGG TTGATCGCCG GCGCGGTCCT CGCCGTCACG CCGGTGGCCA CGCTGATGTT CCGGTTCAAC AACCCGGACG CGCTGTTGGT GCTGCTTCTG GTCGGCGCCG CCTACGCCAC CGTACGAGCG ATCGAGACGG CTGCCACCCG TTGGCTCGTA CTCGCCGGGG TGCTGGTCGG GCTCGGCTTC CTCACGAAGA TGCTGCAGGC GTTCCTGGTG GTACCAGTGC TCGCCGGCGT CTACCTGCTG GCCGCGCCGA CCGGGCTCGG CCGGCGGATT CGTCAGACCC TGCTGGCCGG CCTCGCGGTC GTGCTGTCGG CGGGGTGGTG GGTGGCCATC GTCGAATTGG TCCCTGCCAG CGCTCGCCCG TACGTCGGCG GCTCGCAGAC CAACAGTGTC CTCGAGTTGA CCCTTGGCTA CAACGGTCTC GGCCGCATCA CCGGCCGCGA GGTGGGCAGC GTGGGGCAGT CCGGTGGAGG GAGGTTCGGT GACGGGACCG GACTGCTGCG CATGTTCGAC GACCGGGTCG GCGGGCAGAT CGCCTGGCTG TTGCCAGCCG CGCTGATTCT CCTTGTGGTC GGCCTGCTGA TGGCCGGCCG GGCTCCGCGT ACCGACCGGA CCCGCGCCGG GCTGCTGCTC TGGGGCGGCT GGCTGCTGGT TACCGGCGCG ATCTTCAGTT TCATGTCCGG GATCTTCCAC GAGTACTACA CCGTGGCCCT GGCACCGGCG GTTGGTGCCC TGGTCGGGAT CGGTGTCACG CTGCTGTGGC GGGTGCGAGC CGCTCCGCGC GGCGCTGTCT GGCGTCGGCT GCCGTACGCC GCCACCGCGG TTCTGGCCGG GACGCTGGCT GTCACCGTGG GATGGTCCTG GCTCCTGCTG GGCCGTAGCC CTGACTGGTA TCCGTGGTTG CGTACCACGA TCTTGGTGGG TGGCATCGTG GCGGCGGCGT TGCTGGTGCT TTCTCCTCGG CTACCCCGGT CAGTTGGGGC GGCGGGTGTC GCGCTGGGCG CCGCGGCGGC CCTCGCCGGG CCGGTGGCGT ACTCGGTGCA CGCCTCCGCT ACGGCGCACA ACGGTGGGGT TCCCACCGCG GGCCCGGCGT TGGCTGGCGA TGTCGGCACC AGGTCTGGTG GCGCCGGGGA CGGGCCCGGC GCCCCCGGTG GCGGTCAGCC TCCCGGTGCC GGGCAGTTGC CGCAGAGGCC GGATCGGCAA CCCGGCACGG CGGCCGACGG CCGCCAGCAG GGTGGGTCGG GTCAGCCGGG TGGGTCGGGT CAGCCGGGTG GGTCGGGTCA GCCGGGTGGG TCGGGTCAGC TCGGCCAGCC GGTCGGCGAC GGCGGGCGCG GCGACGGCGG GCTGTTGGGT GCCCGCGTTC CCAGTGCGCA ACTGCGCGAG CTACTCGAGC TCGACAGCGA CAGGTACACC TGGGTGGCGG CCACGGTGGG TGCGAACAAC GCCGCCGGCT ACCAGCTGGC CACCGGTGAT CCGGTGATGC CCGTGGGCGG GTTCAACGGC ACCGACCCCT CCCCGACCGT CGCCGAGTTC CAGCGCTATG TCGCTGACGG AAGGATCCAC TGGTTCATCG GTGGAGGTGG CTTCCGGGGT GCCAACGGTG GCAGCTCCGC CTCCTCCGAG ATCGCCGCCT GGGTGGCGCA GACGTTCGAG GCGCGAACCG TGGACGGAGT CACGATCTAT GACCTGAGCA ACGGGGGGGC CAACGCATGA
|
Protein sequence | MDRTETTYGP QAAAPATSVT STSPAESEAE PTSTPRRSAR WARLCLGGLL LATAWLYLWG LDVSGWANAY YSAAAQAGAE NWTALFYGSS DAANSITVDK TPAALWLMAL SVRLFGLNSW AVLLPQALCG VAAVAVLYAT VRRWYGPAAG LIAGAVLAVT PVATLMFRFN NPDALLVLLL VGAAYATVRA IETAATRWLV LAGVLVGLGF LTKMLQAFLV VPVLAGVYLL AAPTGLGRRI RQTLLAGLAV VLSAGWWVAI VELVPASARP YVGGSQTNSV LELTLGYNGL GRITGREVGS VGQSGGGRFG DGTGLLRMFD DRVGGQIAWL LPAALILLVV GLLMAGRAPR TDRTRAGLLL WGGWLLVTGA IFSFMSGIFH EYYTVALAPA VGALVGIGVT LLWRVRAAPR GAVWRRLPYA ATAVLAGTLA VTVGWSWLLL GRSPDWYPWL RTTILVGGIV AAALLVLSPR LPRSVGAAGV ALGAAAALAG PVAYSVHASA TAHNGGVPTA GPALAGDVGT RSGGAGDGPG APGGGQPPGA GQLPQRPDRQ PGTAADGRQQ GGSGQPGGSG QPGGSGQPGG SGQLGQPVGD GGRGDGGLLG ARVPSAQLRE LLELDSDRYT WVAATVGANN AAGYQLATGD PVMPVGGFNG TDPSPTVAEF QRYVADGRIH WFIGGGGFRG ANGGSSASSE IAAWVAQTFE ARTVDGVTIY DLSNGGANA
|
| |