Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1645 |
Symbol | |
ID | 5705908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1886435 |
End bp | 1889311 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641271153 |
Product | LamG domain-containing protein |
Protein accession | YP_001536528 |
Protein GI | 159037275 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.143921 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGT GCGAGGAGTG GGTGTTGGTG GGCGGCATGG TGCGGTCGCG TGCCCGACGA CGGTCTGTCG GTTGGGCTGT CGTGTCGGTG TTGGCGGCGG TCGGCGGGTT GGCCGTGCCG GCGTCGGCGG CGGTCGCGTG TGTGGGTGAG GTGGCGACCG TCGAGGCCGC GATGACCACT GCGGAGGCCT GTGACACGCA GGTCGAGGTG GGCTCGATGC GGTCGGAAAC ACTGGTGGCC TACGCGAATC CGGATGAAAC GATGACGGCG GTGATGTCAC CGGTTCCGGT ACGAACGTTG AAGGGGTCCG AGTGGGTCGA TATCGACCCG ACCCTGGAAC GCAATGGCGA CGGTACGGTG TCGCCGGCGG CCACGACTCA GGATCTGACG TTGTCCGGCG GTGGTTCAGG CCCGCTGTTG ACGTTGGCTG ATTCTGGCCG GCGGATTTCC CTGACCTGGC AGCCGGGTCT GCCGGCACCG GTGTTGAGTG GAGATCTCGC GACCTATCCG GAGGTCCTCC CGGGTGTGGA TCTTCAGGTT CGTGTCGGCG ACAGCTGGTA TCAGCAGTTG TTCGTGGTGA AGTCGGCGCA GGCGGCGGCG AATCCGGCGT TGGCCGCGTT GTCGTTCGAC GTCGTGACGG AGGGTGTGAC GCTGCGGGAG CAGTCAGATG GGGCAATTGA GGCTGTCGAT ACCGCGGGCG AGGTGGTGTT TGCGGCCACG GCACCGTCGA TGTGGGCATC TCCCGGCCCA TCGGCGGTGG AAGTGGCCGT TCGGCCGGGG GAGTGGTCGC GGCCGTCGTT GCAGGCCATG GTTGGTGAGG GCGTTGCCGG TGAGCCGGCT GAGCCGCATC GGGTGCGGAT GGAGGCGGAG GTTTCCGACT CGCGGCTGAC GGTGGTGCCA GTGCCCGAGT TGCTGCGGTC GCCTGACACG GTCTACCCGG TCTATATCGA TCCGAACTTC GCCTATCCGT CGCCGACGTA CTGGACGAAC GTGATGGACG ACAATCCGAA TCACTCGTAT TTCAACGAGC ATGACGAGTT GAAGGTCGGC CGGCAGTGGA ATACCTCGAA CGTGTGGCGT ACGCACATGC AGTTTCACAA CTTTGGGGTG ATGTCCGGGT CGAGGATCGT GTCGGCGGAG TTCAGGGTAA CCGCTGACCA CACAGCGGAC TGTGACGGTA CGGACATTCA GCTGTGGGAG ACGGAGCACA TCACTCACTT CTACCAGTAC ACGTGGAACA CCGCGGCCAA CGGCTGGTTG AAGTATCTGG ACACCAAGCA TTTCGATGCG AATGAGGCGT CATGTCCCAA GGGCGACGAC CAGGAGGTGT TCAACGGGGC GTTGCAGTCG GCGGTGCAGG CCAAGGTAAG CGCTGGTGCC AGCGCCATGA CGTTCGGTAT GCGAGCCGCC AGCGAGTCTG ACTACTACCA GTGGACGAAT TTCCTGCCTA ATAAGACCGC GTTGATCGTG CAGTACGACA TGGTGCCGAT GAGACCGGTC GGTCTGTCGT TCACGACGAC CTCGGACTGC TATGTGCAGT GTTCGTCCCC GGCGATGGTG CGGAATCTGA CGCCGACGTT GCGGGCGCGC GTGCAGGACG CGGACGGCGG CGTTCTGGAG ACGGCGTTCG AGATCCGGAC GGCGGCGAGC CTTTCGGCAC CGATCGTGGT GGAAAGCACG ACGATGCCAC GGTCCGTCGT GACGACGTCG GGTAACGCGA CGGCGACCGC GACGTCTCAG GTTCCGGCCG GCGAGTTGAC CTCTGGGGTC ACCTATTACT GGCATGCCAC CACGATGGAC GAGTTGGGTT TTTGGAGTGG ATGGGGTCCG TGGTACAGCT TCACGGTCGA CACGTCTCCA CCTGGGGTGT CGACGGTCAC GTCCAGTGAG TTTCCCGATC GGCAGTGGGG CGCTGAGGTG GGTACGGCGG GAACGTTCAC GCTGAGCGCT GCCTCGGACG CGGCGGAGTT CACCTGGCAG GTGGATGCGG GGCCGGTGAC GACGGTGGCG GCGACCGGCG GTGATCCGGC CACGGCGACG ACCGGTTCGT ACACACCGGC TACGGACATG GTGCACACGT TGTACGCGAA GGCCAAGGAC GTGGCGGGCA ACGTCGGCCC GACTGTTGAG CATCAGTTCT GGGTGACGCC GCTTGCGAAC CGGTGCTGGA ACTGGCGGTT GGACGAGACG GCCGGGGCGA CCGCGAAGGA CTGGGGTAAC CAGGATCCCG CCGACGTCGC GTGCCTGCCG ATCGGGTCGT CGGTGACGCC GATGCCCGGC GCCGTCTCGT CAGGAGTGAC GTGGACGGCA GATGCGGAGC GCGGGCAGGT GGCCAGCTTC GACGGTACGG GAGAGGTCGC CACCTCGGGT GCGGTGCTCG ACACGACGAA GGCGTTCACG GTGACCGCGT GGGTGAAGCT GACGGACCTG GCGTCGGGCA GTGTGCAGAC GGTCGTTTCG CAGGCCGGTG ACGACGTCAG TGATCTGAGT CTGGAGTACC GGCGGGACGT GAACGGCGGC GCGGGTGGGT TCTGCTTCAC GATGGCCGCA GGTGACGGTG AGTTGACGAC GGCTTGTGCG GATCCGGTGT CGTGGCCGGT CAGTGAGGGC CAGTGGGTGC ACCTGGCCGG TGTGTATGAC CCGACGCTGA ACGCGATCCG GGTGCACGTG ATGGGTGATC CAGTGCGCTG TGCGGGAGAT TTCGGCGAAA GTGCACTCAC ATCGTCGCGG GCGGCCAGTG GGGCGTTTTT GATCGGTCGC GGAACCCAAG GCGTCGATGA CACACCTGCC CATTCGCTTC TTGGGCAGGT ATCCGATGTG TATGCCTTCC AGCGGGTGTT GAGCAACCAG GAAATCTGTC AAATGAGTTT GCCTTAG
|
Protein sequence | MSMCEEWVLV GGMVRSRARR RSVGWAVVSV LAAVGGLAVP ASAAVACVGE VATVEAAMTT AEACDTQVEV GSMRSETLVA YANPDETMTA VMSPVPVRTL KGSEWVDIDP TLERNGDGTV SPAATTQDLT LSGGGSGPLL TLADSGRRIS LTWQPGLPAP VLSGDLATYP EVLPGVDLQV RVGDSWYQQL FVVKSAQAAA NPALAALSFD VVTEGVTLRE QSDGAIEAVD TAGEVVFAAT APSMWASPGP SAVEVAVRPG EWSRPSLQAM VGEGVAGEPA EPHRVRMEAE VSDSRLTVVP VPELLRSPDT VYPVYIDPNF AYPSPTYWTN VMDDNPNHSY FNEHDELKVG RQWNTSNVWR THMQFHNFGV MSGSRIVSAE FRVTADHTAD CDGTDIQLWE TEHITHFYQY TWNTAANGWL KYLDTKHFDA NEASCPKGDD QEVFNGALQS AVQAKVSAGA SAMTFGMRAA SESDYYQWTN FLPNKTALIV QYDMVPMRPV GLSFTTTSDC YVQCSSPAMV RNLTPTLRAR VQDADGGVLE TAFEIRTAAS LSAPIVVEST TMPRSVVTTS GNATATATSQ VPAGELTSGV TYYWHATTMD ELGFWSGWGP WYSFTVDTSP PGVSTVTSSE FPDRQWGAEV GTAGTFTLSA ASDAAEFTWQ VDAGPVTTVA ATGGDPATAT TGSYTPATDM VHTLYAKAKD VAGNVGPTVE HQFWVTPLAN RCWNWRLDET AGATAKDWGN QDPADVACLP IGSSVTPMPG AVSSGVTWTA DAERGQVASF DGTGEVATSG AVLDTTKAFT VTAWVKLTDL ASGSVQTVVS QAGDDVSDLS LEYRRDVNGG AGGFCFTMAA GDGELTTACA DPVSWPVSEG QWVHLAGVYD PTLNAIRVHV MGDPVRCAGD FGESALTSSR AASGAFLIGR GTQGVDDTPA HSLLGQVSDV YAFQRVLSNQ EICQMSLP
|
| |