Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1360 |
Symbol | |
ID | 5705574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1568335 |
End bp | 1572078 |
Gene Length | 3744 bp |
Protein Length | 1247 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270871 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001536252 |
Protein GI | 159036999 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.858944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000129137 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACCTGT TCCGCAGACT CCGGATCCCC TGGGCCGAGC GACCCGCCAC CAACCACATG GATGCCACCG ATGACGACAG GCCGCTGGTG CCGGCGACAC GGAGCGCGCA GGACACCGCG ACGAGCCCGC CATCGGAGGC CCCGAACCCG CCACCACCGA CACTCCTTGA CCCGATGGCA CCACCAACGG CCCTGGACCC GGCGGCACCA CCAACGGTCC TGGACCCGAT GGCACCACCG ACGGCCCTGG ACCCGGTGGC GCTGCCCCGC TGTTTCGGCC CGGTGCCGAA CCACGCGCAC AGCCCGCTGC CGGCCCTGAA CGCCGACGGC GAGGTGATTC CCGGCACCGG CATCCGAAAG TTCGTCGACC CGCTGCCCAC GCTCGACTCG GGCTACCACC CAGCCGGCGG CCTGCCGGTG GCGGTCCCGG ACACCATCAC CTACCCGGGC TGCGACTACT ACGAAATCGG TCTCCAGGAG TACGCGCAGC GGCTGCACCG GGACCTTCCG GCAACCCGGC TGCGCGGGTA CCGGCAGCTC AACCTCGGCA CCGACCCGAC CGGGCACAAC ACGGTCGCAC CACCGGAGCA CCCGGTTCAC CTCGGCCCGG TGATCGTCGC GCGGCGCGGC CGGCCGGTGC GGGTCAAGTT CATCAACCAG CTTCCCACCG GGCAGGCCGG TGAGCTGTTC CTACCCGTCG ATCTGAGCAT CGACGGCGCC GGCCGGGGAC CGCTCGACGG ACCGGCACCG TACCCGCAGA ACCGGGCGGT GCTGCACCTC GCGGGCGCGC AGACCGGTTG GATCAGCGCC GGAACCCCAT GGCAGTGGAT CACACCCGCC GCAGAGGTCA CCCCGTACCC GACCGGTCCT GGCCTGGCCC ACGTGCCGGA CATGCCGCCG CCCGGCATCG GGGCCACCAC GCTGTACTAC CCGAACGAAC AGAGCGGGCG GCTCCTCGCG TTCCACGACA ACACCGCCGG ACTCGCCCGA CTCACAATCT ACTCTGGTCA GCTCGGCCTT TACCTGCTCA CCGATCCCGC CGAAGCTGGC CTGGTCGCCG ACGGCGTGCT GCCCGCCGAC CAGATCCCCC TGGTCTTCGA GGACAAGACG TTCGTGCCCG ACGACAGCCA GCTGGCCGAC ACCGACCCGA CCTGGGACCG GGACCGGTGG GGCGCCCGAG GCAGCCTCTG GCACCCGCAC GTCTACCAAC CCCGGCAGAA CCCGTACCGA GACGACGGCA TCAACCCGAC CGGACGCTGG GACTACGGAC CGTGGACGCG CACCGCCGAG GAGATGGGGG TGAACGGCAC GACCGGGGCG GACGGCATCG GAGCGGGCCC GGTGCCGAAC CCACACCACG ACCCGGTCGG TGACACGCCG GATGAGCCAC CGATGACACC AGGCGTACCG CACCCGTCGG CGGTACCCGC CGCATACGGC GACACCGTGC TGGTCAACGG GGTGGCGTAC CCGTGGCTGA CGGTCGAACC GCGCACCTAT CGGTTCCGCG TCCTCAACGC CTGCCTGGAC CGCAGCCTGA ACCTTCAGCT CTATCGGGCG CGCGGCACCG GACCGATGTG GGACCGCAAC GGCGAACTGG CCGACCCGGA CGCCGGCGAG GTTCCGATGG TGGCGGCGAC ACGGACACCA GACCGACCCG CCCGATGGCC GACAGACGGA CGGGAAGGCG GAGTGCCGGA CCCGCTCGCC GCCGGCCCCG ACCTGGTGCA GATCGGCAAC GAAGGAGGCC TCCTGCCGGC CCCGGCGGTA CTCCCCAGCC AACCGGTGAC CTACCGACAC GACCGGCGCG ACCCCACCGT CCTCAACATC GACCGGCACG CGCTGCTGCT CGCCCCCGGA GAACAGGCCG ACGTGCTGGT CGACTTCGCC ACCGTCGCCC CCGGCAGCAC CGTGCTCCTC TACAACGACG CCCCGGCCCC GCTCCCCGAC TTCGACCCCC GGTACGACCA CCACACCGAA GCCCCGGACC ACACCGCCGA AGGTGGGACA CCACCGAGCC GACCCGGGTA CGGGCCGAAC ACCCGCACCC TGCTGCAGTT CCGGGTGGCC GGCACCCCGG CGCCCCACTA CGACCTGGCC CGGCTCCGGG AACGGCTACC CCGGGCGTAC GCGGCCGGTC AGCTTCCGCC GATCGTGCCG CAACCCGCCT ACGACGCGGC GTTCGGCACC CGCACCGCCC GCGAAACCAC AGTCCCGGTG CACGCCACCA CAGTCGGTTT CACCCCGGCC GGGGGTACCG ATCCCGTCCT GCTGCCGCTC GCCGTGAAGG CCGTCCAGCA GGTCTTCGAA CCGGCACACG GCCGGCTCGC CGGCCGGCTC GGCGTCGGGC CCCCACCCGG TGGCCCGCTC GCCCCCGCCG TGGTGTCGCT CAACCCCACC GACCCGGCCA CCGAGATGGT GCAGGTCGGT GAGCCGACGG TCCCGATGGG CCCGGCCGTC GACGGCACAC AGCTGTGGCG GATCCGTGGC GCCAGCCGGC AGACCCAGCC GATCCGCCTC GACGGTGTCG ACCTGCAACT GATCAACCGG GCAGGGTGGG ACGGCACGCT CCGCCCACCG GACCCGAACG AGGTGGGCTG GAAACAGGTC ATCCGGGTCA ACCCGCGCGA GGACGTGGTG GTCGCGCTGC ACCCGGTCGC GCCGCCCCTC CCATTCAAGA TCGCCGACAG CGTACGGCCG CTGGACCCGA GCCGGCCGGG CGACGTCCAC ACCGGAGGAT TCTCGCTGAC CTGTCGCGCC GCCCCGGCGG TCAACCAGCC CGTCAACCTG GGCTGGGAAT ACCGCTGGCA CAGCCAGCTC GCCGGATACC GGGACCAGGG CATGTGCCGA CCACTGGTGT TGCGGGTCGC CCCACAGGCT CCGACCGGGT TGACCGCGAC CCCGGGCCCC GGCTCGGCCA CCGCCCTGCC GGCCATCCTG CTCGCCTGGA CCGGCAACGG CAACCCGCCC GCCGCCACCA GCCACCTGCT GCAACGGGCC ACCGACCCGA CGTTCACCGA CGGGCTGACC GCGATCACGG TGGCAGCCAG CGCCACCCAC TACACCGACG CGACCGTCAC TCCGGGAGTG ACCTACCACT ACCGCATCCG GGCGGAGAAC GCGGTCAGCT GCTCGGCATG GTCGAACTGT GCGCCCGCGT CGGTGCGGCT CGCCGCACCG ACCAGCCTGG CCGCGGTGGT GCCGCCGACG GCCCCGCTGC GGGTGGCGCT GCACTGGCGT AACCGTTCCT TCGCCACCGG CGTCGACGTG CAGCGCGCCA CCAACCCCAC TTTCACCAGC GGGCCCGGCA CCACGGCCAT CAGCGTGGGC GACACCCACC TCGACCCCGC CGTCGTACCG AACACCCGCT ACTACTACCG GGTCCGCACC ACCTACCTGG GGGCGGCGTC ACCGTTCTCC ACTGTGGCGC AGGTGACCAC ACCGCCCCGG CCCGGCACAC CGGAGGCGGT GACCGTCACC GCGACCGCGT CCGCCCCGGA CACCGCGACC GTCATCCTGG GCTGGGCCGC GAACGCCCCC GCCGGGCCGG GCGGCGGATT CACCGTGCAG CGGGCCGCGG ACCCGAACTT CACCCGCGAG GTCGCCACCT TCACCGTCAA CGGGCGGGGG TTCACCAACA CCGGCCTGGC CCGCAGGGCC ACCTACCACT ACCGGATCCG CGCGTTCAAC GTCGTCGGGA CGTCCCCCTT CACTAACCCG GTCGCGGTGA CGACGCCGGA CTGA
|
Protein sequence | MDLFRRLRIP WAERPATNHM DATDDDRPLV PATRSAQDTA TSPPSEAPNP PPPTLLDPMA PPTALDPAAP PTVLDPMAPP TALDPVALPR CFGPVPNHAH SPLPALNADG EVIPGTGIRK FVDPLPTLDS GYHPAGGLPV AVPDTITYPG CDYYEIGLQE YAQRLHRDLP ATRLRGYRQL NLGTDPTGHN TVAPPEHPVH LGPVIVARRG RPVRVKFINQ LPTGQAGELF LPVDLSIDGA GRGPLDGPAP YPQNRAVLHL AGAQTGWISA GTPWQWITPA AEVTPYPTGP GLAHVPDMPP PGIGATTLYY PNEQSGRLLA FHDNTAGLAR LTIYSGQLGL YLLTDPAEAG LVADGVLPAD QIPLVFEDKT FVPDDSQLAD TDPTWDRDRW GARGSLWHPH VYQPRQNPYR DDGINPTGRW DYGPWTRTAE EMGVNGTTGA DGIGAGPVPN PHHDPVGDTP DEPPMTPGVP HPSAVPAAYG DTVLVNGVAY PWLTVEPRTY RFRVLNACLD RSLNLQLYRA RGTGPMWDRN GELADPDAGE VPMVAATRTP DRPARWPTDG REGGVPDPLA AGPDLVQIGN EGGLLPAPAV LPSQPVTYRH DRRDPTVLNI DRHALLLAPG EQADVLVDFA TVAPGSTVLL YNDAPAPLPD FDPRYDHHTE APDHTAEGGT PPSRPGYGPN TRTLLQFRVA GTPAPHYDLA RLRERLPRAY AAGQLPPIVP QPAYDAAFGT RTARETTVPV HATTVGFTPA GGTDPVLLPL AVKAVQQVFE PAHGRLAGRL GVGPPPGGPL APAVVSLNPT DPATEMVQVG EPTVPMGPAV DGTQLWRIRG ASRQTQPIRL DGVDLQLINR AGWDGTLRPP DPNEVGWKQV IRVNPREDVV VALHPVAPPL PFKIADSVRP LDPSRPGDVH TGGFSLTCRA APAVNQPVNL GWEYRWHSQL AGYRDQGMCR PLVLRVAPQA PTGLTATPGP GSATALPAIL LAWTGNGNPP AATSHLLQRA TDPTFTDGLT AITVAASATH YTDATVTPGV TYHYRIRAEN AVSCSAWSNC APASVRLAAP TSLAAVVPPT APLRVALHWR NRSFATGVDV QRATNPTFTS GPGTTAISVG DTHLDPAVVP NTRYYYRVRT TYLGAASPFS TVAQVTTPPR PGTPEAVTVT ATASAPDTAT VILGWAANAP AGPGGGFTVQ RAADPNFTRE VATFTVNGRG FTNTGLARRA TYHYRIRAFN VVGTSPFTNP VAVTTPD
|
| |