Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1279 |
Symbol | |
ID | 5706504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1480497 |
End bp | 1483610 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641270794 |
Product | putative large secreted protein |
Protein accession | YP_001536175 |
Protein GI | 159036922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.97722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000946553 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGCGC CTCGCCTGAT CAGTCCCTTC GTGCGCACGC GCACCCGGCT GGCCCTCACC CTCGGGCTCC TGCTGACCGC TGTCGTCACG GCCCTGCTGC CGTGGTGGCC GACAGCCGAC GAGCCCCCGA AGGGAGTCGT CAGCATCGCG GCGGCCCCGC TGAAGGACGA GGCCGCCGCG ATGGCAAAAG CGCTCAGCAC CGGCAAGGAG GTGCTGGTCG AGACAGCCAC CAGCGCCACC TCACTCACCT GGGCGCTGCC GAACGGGCAA CTGCGCTCCA CCTTTCACGC CACGCCACAG CGGACGAAGA GCACCGCAGG TCGATGGAGG CCAGTCGACA CCACGCTGAC CCGCACCGAC ACGACGCCCG ACGGCCTCGG CATCCGACCG GTCAACGCTG TCTCCCCCGT CCGGTTCTCG GCCGGCACCC GCGCGTCCGA CCAAGCGGAC GGCGCGGGCG AGGGCCAGGC TCCTGTCGGT GACGGTGAAA CCGTCCTGGC CGAAGCGGAC GTCGACGGCC ACACCATCGC CTTCACCTGG CCCGGTCACC TGCCAGAACC GGTCCTCGAC GGCCCGCGTG CCCTCTACCC CGACGTTCTC CCCGGCGTGG ACCTGCTCGT CGTCGCCCGC GACGTGGGCG GATTCGGCCA GCTGCTGATC GTCAAGAACC GCGCGGCCGA GACGATCAAG GCCGCCGGCG CCGTGACCTA CGGACTGCGG TCGGAAACGG CGGTCTTCCG CCACAACGCC ACCACCGGGG GGATTCAGGT CCTGGACCGG ACGGGCCAGG AGGTCGGCTC GGTCCCCACC CCGTTCGCCT GGGACTCCGC CGGTCGAGTG GACCCCGACA CCAGGATCCG CACCGCGGTG GACACTCCCG CCGACGTGCT GGAGCTCACC GGCCTCACCG GCAGCGAGCC TGGCGCCCGG AACGCCCAGA TCCCCACCCG GGTGGACGGC GACGGCACCG GCGCCCTCCA CCTGCACCTG GATGCCGCTG CCACCGGGCT CCTGAGCGAC CCGGACACGC TCTTCCCCGT CTTCCTGGAC CCGACGCTCA ACAGCGGCGT GGTCGACTGG GCGACCGTCT ACTCGCAGTA CCCCACCACC AACACCTGGA ACGGAACCAA CTTCAACTCC GGAACGACCG ACGCGCGGGT GGGATACATA TCGAGCGTCC CGCTGCGGAC CCGCTCGTTC TGGCGGATGG GCTTCAGCAG CTCCCTACGG GGCGCGACGG TCAGCTCGGC GACCTTCAAG GTGCTGAACA ACCACTCCTA CAACTGCGAA CGACGGGAGA TGCAGCTGTG GCTCGTCGGC TCCATCTCCT CCGGCACGAC CTGGAACGCC CAGCCCAGCT ACATGGCCTT GCAGCAGAAG CTCGCGTTCG CGCACGGCTA CGGCAGCAGT TGCGCCGACG AATATGTGAG CTTCAACGTG AAGAACGCGG CCCAGCGGGG CGCGGACGGA GGGTGGTCGA GCTTCAACCT GGGAATGCGG GCCACGAGTG AGTCCGACAC CAAGACCTGG CGCAAGTTCA AGGCGAGCTC CGCGAGCCTG TCGGTCACCT ACAACCGCGC GCCCAACACC CCGACCAGCC TCACCGCCTC CCCTGGTGGT GCCTGTGCTC CCACCGGGGT CACAGTCGCC AAGACGGACC TCACCCTGTC CGCGACCGCG ACCGACCCCG ACGGCAACCT GAAGGGCCTA CGCTTCCGCT TCTGGAAGAG TGGCTCGGCG GTTCCCACCG GCACGCTGGT CACCACCACC AGCGCCGGCA AGGCCAGCCT GACGGTCCCC AGCACCACCC TGGTCGACGA GGGCGTCTAC CTGTGGAACG TGCGCGCCGA GGACACCTCC AACGCGGCCT CCGGCTGGAA CCCGCCCAGC ACACCGTGCA CGCTCACCGT GGACGCCTCG GCACCACCGG CGCCAGTCGT CGACAGCGAC GTGTTCCTGG AAGCCACCCC CGACGGGGCC ACCTGGGCGA CCGTGAAGTT CGGGCAGACC GGACCGGTCA CCTTCACCGC CGCCGGGGCA GCAAGGTTCA GCTACGCCTT CGAGGCGATC GGCACCACGT ACGTGGACGC CACCGACGGC ACCGCTACTG TGCCGGACCT GAAACCCCGG CACGCCGGAC CCACCACCCT GCACGTCTAC GCCTACGACA ACGTCGGCAA CAAGAGTGCC CGGACGGACT ACTCCTTCTA CGTACCGCCC CGCGACACCG CGGACGGGCC CGGAGACACC GGCGGCGATG GGATCCCCGA CCTGCTCCTC GTCGATTCCA CCGGCAACCT ACGGAACTAC GCGGGTGACG TGGACGGCGA ACTGTACGCC TGGCAGGCCG CCTCCTACAC CGGGGAGGGA ACGCTCAACC CGCCCGGTCA CTGGTACGAC CCGGAAACCG ACACGGCCGC GCTGATCACC AAACACTCCG ACGCCTACCC GGGTGACGGC TCCACCGACC TGTTCGCCCG AACCCCGGAC GGTGGCTTCT GGCTCTACCC CGGCGACGGG TACGGCACCT TCAACGTCGA CGACCGGCTA CGCGTCCTGC TGCCGGACAA CACACCCGAT CCCGCGACCT GGACCCAGAT CAAGGCGCTC GGCGACGTCA CCGGCGACGG GCACCCCGAT CTGGTCCTAC GGGCCGGGAC TGCGTTCTGG ACGCTGAGCG GTTACACGGG CGCCAGCTTC CAGGAAGCGA TCCTGATGAA CGGGAACGCG TGGGCGCGCC GGGAGATCGT CAACGTCGCG GACATCGACC TGGACAGCAC CCCGGACCTG CTCTGGCGGA ACCTGGACAA CGGCAACATG TACATCCGCC ACGGGAAACC GGGCGCGGTC ACCGGCAGCG TCGATCTGGA TTCGATCAAA CTCGCGGCGA ACTCCCGTGA GGGCGACGTC TCCTACGGCG TCAGCTGGAC GGAAACCAAC GTCAACGCGG TGATCGGTAT CCCCGATGTG AACGGGAACG GCGTCCCTGA CCTGTGGGCC CGATTCGGTC AGGACGGCAT GATGCGGATC TACCATCCGT CGACCATCAA CACCCACGGC CCAGTGAAGA TCGTGCTGGG GGACGACTGG AACGGCGTCA AGGCCTTCGG CTGA
|
Protein sequence | MTAPRLISPF VRTRTRLALT LGLLLTAVVT ALLPWWPTAD EPPKGVVSIA AAPLKDEAAA MAKALSTGKE VLVETATSAT SLTWALPNGQ LRSTFHATPQ RTKSTAGRWR PVDTTLTRTD TTPDGLGIRP VNAVSPVRFS AGTRASDQAD GAGEGQAPVG DGETVLAEAD VDGHTIAFTW PGHLPEPVLD GPRALYPDVL PGVDLLVVAR DVGGFGQLLI VKNRAAETIK AAGAVTYGLR SETAVFRHNA TTGGIQVLDR TGQEVGSVPT PFAWDSAGRV DPDTRIRTAV DTPADVLELT GLTGSEPGAR NAQIPTRVDG DGTGALHLHL DAAATGLLSD PDTLFPVFLD PTLNSGVVDW ATVYSQYPTT NTWNGTNFNS GTTDARVGYI SSVPLRTRSF WRMGFSSSLR GATVSSATFK VLNNHSYNCE RREMQLWLVG SISSGTTWNA QPSYMALQQK LAFAHGYGSS CADEYVSFNV KNAAQRGADG GWSSFNLGMR ATSESDTKTW RKFKASSASL SVTYNRAPNT PTSLTASPGG ACAPTGVTVA KTDLTLSATA TDPDGNLKGL RFRFWKSGSA VPTGTLVTTT SAGKASLTVP STTLVDEGVY LWNVRAEDTS NAASGWNPPS TPCTLTVDAS APPAPVVDSD VFLEATPDGA TWATVKFGQT GPVTFTAAGA ARFSYAFEAI GTTYVDATDG TATVPDLKPR HAGPTTLHVY AYDNVGNKSA RTDYSFYVPP RDTADGPGDT GGDGIPDLLL VDSTGNLRNY AGDVDGELYA WQAASYTGEG TLNPPGHWYD PETDTAALIT KHSDAYPGDG STDLFARTPD GGFWLYPGDG YGTFNVDDRL RVLLPDNTPD PATWTQIKAL GDVTGDGHPD LVLRAGTAFW TLSGYTGASF QEAILMNGNA WARREIVNVA DIDLDSTPDL LWRNLDNGNM YIRHGKPGAV TGSVDLDSIK LAANSREGDV SYGVSWTETN VNAVIGIPDV NGNGVPDLWA RFGQDGMMRI YHPSTINTHG PVKIVLGDDW NGVKAFG
|
| |