Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2279 |
Symbol | |
ID | 5706038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2616289 |
End bp | 2618391 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641271758 |
Product | glycosyl transferase family protein |
Protein accession | YP_001537129 |
Protein GI | 159037876 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.286519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00538135 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACAGAA CCGAGACGAC GACCCACGGG CCCCGTGCCT TCGCCCCGGC GGGGAGCGTC GCTCCGACGC CGTCGGCCGA GGCCGACGCA CAGCCGACAC CCGCGCTGCG GCGGCCGGCC GGCTGGCCCC GCCTCTGCCT CGGCGGGCTG CTACTTGCCA CCGCTCTTCT CTACCTGTGG ATGCTCGACG TCTCCGGCTG GGCGAACGCC TACTACTCGG CGGCAGCACA GGCCGGCGCG CAGAACTGGA CCGCCTTCTT CTACGGCTCG TCGGACGCCG CCAACTCCAT AGCCGTCGAC AAGACGCCTG CCGCCCTGTG GCTGATGGCG TTGTCGGTGC GGCTGTTCGG CCTGAGTAGT TGGGCGGTGC TGCTGCCGCA GGCGTTGTGC GGGGTGGCCG CGGTCGGGGT GCTGTACGCC GCGGTGCGGC GCTGGCACGG CCCGGTAGCG GGTCTGATCG CCGGTGCGGT CCTCGCCGTC ACGCCGGTGG CCACGCTGAT CTTCCGGTTC AACAACCCGG ACGCGCTGCT GGTGCTGCTC CTGGTCGCCG CCGCGTACGC CACCGTACGG GCGGTCGAGA CGGCCGCCAC CCGGTGGCTC GTGCTTGCCG GCGTGCTGGT CGGGCTCGGC TTCCTCACGA AGATGCTGCA GGCGTTCCTG GTGGTGCCGG TGCTGGCCGG CGTGTACCTG CTGGCCGCGC CGACCGGGCT TGGCCGGCGG ATCCGCCAGA CGCTGCTGGC CGGTCTCGCG GTCGTGCTGT CGGCGGGGTG GTGGGTTGCC ATCGTCGAGT TGGTCCCGGC CAGCGCCCGT CCGTACGTCG GTGGCTCACA GACCGACAGC GTCCTCGAAC TGACCCTCGG CTACAACGGG CTTGGCCGTA TCACCGGCCG GGAGGAGGGT AGTGTCGGCC GGCCCGGCGG AGGGCCGTTC GGTGATGGGG CCGGACTGCT GCGCATGTTC GACGACCGGG TCGGCGGGCA GATCTCCTGG TTGTTGCCGG CCGCGTTGAT TCTGCTCGCG GCCGGTCTGC TGCTGGCCGG ACGGGCGCCG CGTACCGACC GGACCCGCGC GGGGCTGCTG CTGTGGGGCG GCTGGCTGCT GGTCACTGGT GCGATTTTCA GTTTCATGTC CGGGATCTTC CACGAGTACT ACACCGTTGC CCTGGCGCCG GCGGTCGGTG CCCTTGTCGG GATCGGTGTC ACGCTGCTGT GGCGGGTGCG GTCCGTTCCG GACGGCACCG CCTGGCGCCG GTTGGCTGCC AGCGCTGTTC TGGTCGGGAC ACTGGCCGTT ACCGCTTGGT GGTCCTGGCT GCTGCTGGGT CGGAGCCCTG ACTGGCATCC GTGGTTGGGT ACCACGGTTC TGGTGGTCGG CCTCGCAGCG GCGGCGTTGC TGGCGCTCTC TGCGCTGCTG CCCCGCGCCG TTGGGGCGGT GGGGCTCGCG CTGGGCGCCG CGGCCGCCCT CGCCGGGCCG GTGGCGTACT CGGCGCACAC CGCCGCGACG GCACACAGTG GTGGGATTCC CACCGCCGGC CCGGCGGTGG CTGGCGATGC CGGCGCCCGA CCAGGTGGTC CTGGGGGCGG TCCCGGCGCC GGTCAGCCTC CCCGTGGCGG GCAGTCGCCG CAGGGACCGG ACAGGCAGTC CGACGCGGGA CTCATTGGCC GCCAGCCCGG CCGACCAGAT CAGTCGGGTC AGCCGGGCCG GCCCGCGGGT GGCGGCGGGC GGGGCGACGG CGGGCTGCTG GGTGCCCGCG TTCCCAGTGC GCAGCTGCGC GAGCTTCTCG AACAGGACAG CGACAACTAC ACCTGGGTGG CGGCCACGGT GGGGGCGAAC AATGCCGCCG GATACCAGCT GTCCACGGGC GACCCGGTGA TGCCCGTCGG TGGGTTCAAC GGCACCGACC CCTGGCCGAC CGTGGCTAGG TTCCAACGGT ACGTCGCCGA CGGAGAGATC CACTGGTTCA TCGGCGGGGG TGGCTTCCGG GGCGCCAACG GCGGCAGCTC CGCGTCGTCC GACATCGCCA CCTGGGTGGC GGAGACGTTC GAGGCGCAGA CCGTGGACGG AGTCACCATC TACGACCTGA GCAGCGGGGA GGTCGAGCGA TGA
|
Protein sequence | MDRTETTTHG PRAFAPAGSV APTPSAEADA QPTPALRRPA GWPRLCLGGL LLATALLYLW MLDVSGWANA YYSAAAQAGA QNWTAFFYGS SDAANSIAVD KTPAALWLMA LSVRLFGLSS WAVLLPQALC GVAAVGVLYA AVRRWHGPVA GLIAGAVLAV TPVATLIFRF NNPDALLVLL LVAAAYATVR AVETAATRWL VLAGVLVGLG FLTKMLQAFL VVPVLAGVYL LAAPTGLGRR IRQTLLAGLA VVLSAGWWVA IVELVPASAR PYVGGSQTDS VLELTLGYNG LGRITGREEG SVGRPGGGPF GDGAGLLRMF DDRVGGQISW LLPAALILLA AGLLLAGRAP RTDRTRAGLL LWGGWLLVTG AIFSFMSGIF HEYYTVALAP AVGALVGIGV TLLWRVRSVP DGTAWRRLAA SAVLVGTLAV TAWWSWLLLG RSPDWHPWLG TTVLVVGLAA AALLALSALL PRAVGAVGLA LGAAAALAGP VAYSAHTAAT AHSGGIPTAG PAVAGDAGAR PGGPGGGPGA GQPPRGGQSP QGPDRQSDAG LIGRQPGRPD QSGQPGRPAG GGGRGDGGLL GARVPSAQLR ELLEQDSDNY TWVAATVGAN NAAGYQLSTG DPVMPVGGFN GTDPWPTVAR FQRYVADGEI HWFIGGGGFR GANGGSSASS DIATWVAETF EAQTVDGVTI YDLSSGEVER
|
| |