Gene Sare_1279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1279 
Symbol 
ID5706504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1480497 
End bp1483610 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content69% 
IMG OID641270794 
Productputative large secreted protein 
Protein accessionYP_001536175 
Protein GI159036922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.97722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000946553 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGCGC CTCGCCTGAT CAGTCCCTTC GTGCGCACGC GCACCCGGCT GGCCCTCACC 
CTCGGGCTCC TGCTGACCGC TGTCGTCACG GCCCTGCTGC CGTGGTGGCC GACAGCCGAC
GAGCCCCCGA AGGGAGTCGT CAGCATCGCG GCGGCCCCGC TGAAGGACGA GGCCGCCGCG
ATGGCAAAAG CGCTCAGCAC CGGCAAGGAG GTGCTGGTCG AGACAGCCAC CAGCGCCACC
TCACTCACCT GGGCGCTGCC GAACGGGCAA CTGCGCTCCA CCTTTCACGC CACGCCACAG
CGGACGAAGA GCACCGCAGG TCGATGGAGG CCAGTCGACA CCACGCTGAC CCGCACCGAC
ACGACGCCCG ACGGCCTCGG CATCCGACCG GTCAACGCTG TCTCCCCCGT CCGGTTCTCG
GCCGGCACCC GCGCGTCCGA CCAAGCGGAC GGCGCGGGCG AGGGCCAGGC TCCTGTCGGT
GACGGTGAAA CCGTCCTGGC CGAAGCGGAC GTCGACGGCC ACACCATCGC CTTCACCTGG
CCCGGTCACC TGCCAGAACC GGTCCTCGAC GGCCCGCGTG CCCTCTACCC CGACGTTCTC
CCCGGCGTGG ACCTGCTCGT CGTCGCCCGC GACGTGGGCG GATTCGGCCA GCTGCTGATC
GTCAAGAACC GCGCGGCCGA GACGATCAAG GCCGCCGGCG CCGTGACCTA CGGACTGCGG
TCGGAAACGG CGGTCTTCCG CCACAACGCC ACCACCGGGG GGATTCAGGT CCTGGACCGG
ACGGGCCAGG AGGTCGGCTC GGTCCCCACC CCGTTCGCCT GGGACTCCGC CGGTCGAGTG
GACCCCGACA CCAGGATCCG CACCGCGGTG GACACTCCCG CCGACGTGCT GGAGCTCACC
GGCCTCACCG GCAGCGAGCC TGGCGCCCGG AACGCCCAGA TCCCCACCCG GGTGGACGGC
GACGGCACCG GCGCCCTCCA CCTGCACCTG GATGCCGCTG CCACCGGGCT CCTGAGCGAC
CCGGACACGC TCTTCCCCGT CTTCCTGGAC CCGACGCTCA ACAGCGGCGT GGTCGACTGG
GCGACCGTCT ACTCGCAGTA CCCCACCACC AACACCTGGA ACGGAACCAA CTTCAACTCC
GGAACGACCG ACGCGCGGGT GGGATACATA TCGAGCGTCC CGCTGCGGAC CCGCTCGTTC
TGGCGGATGG GCTTCAGCAG CTCCCTACGG GGCGCGACGG TCAGCTCGGC GACCTTCAAG
GTGCTGAACA ACCACTCCTA CAACTGCGAA CGACGGGAGA TGCAGCTGTG GCTCGTCGGC
TCCATCTCCT CCGGCACGAC CTGGAACGCC CAGCCCAGCT ACATGGCCTT GCAGCAGAAG
CTCGCGTTCG CGCACGGCTA CGGCAGCAGT TGCGCCGACG AATATGTGAG CTTCAACGTG
AAGAACGCGG CCCAGCGGGG CGCGGACGGA GGGTGGTCGA GCTTCAACCT GGGAATGCGG
GCCACGAGTG AGTCCGACAC CAAGACCTGG CGCAAGTTCA AGGCGAGCTC CGCGAGCCTG
TCGGTCACCT ACAACCGCGC GCCCAACACC CCGACCAGCC TCACCGCCTC CCCTGGTGGT
GCCTGTGCTC CCACCGGGGT CACAGTCGCC AAGACGGACC TCACCCTGTC CGCGACCGCG
ACCGACCCCG ACGGCAACCT GAAGGGCCTA CGCTTCCGCT TCTGGAAGAG TGGCTCGGCG
GTTCCCACCG GCACGCTGGT CACCACCACC AGCGCCGGCA AGGCCAGCCT GACGGTCCCC
AGCACCACCC TGGTCGACGA GGGCGTCTAC CTGTGGAACG TGCGCGCCGA GGACACCTCC
AACGCGGCCT CCGGCTGGAA CCCGCCCAGC ACACCGTGCA CGCTCACCGT GGACGCCTCG
GCACCACCGG CGCCAGTCGT CGACAGCGAC GTGTTCCTGG AAGCCACCCC CGACGGGGCC
ACCTGGGCGA CCGTGAAGTT CGGGCAGACC GGACCGGTCA CCTTCACCGC CGCCGGGGCA
GCAAGGTTCA GCTACGCCTT CGAGGCGATC GGCACCACGT ACGTGGACGC CACCGACGGC
ACCGCTACTG TGCCGGACCT GAAACCCCGG CACGCCGGAC CCACCACCCT GCACGTCTAC
GCCTACGACA ACGTCGGCAA CAAGAGTGCC CGGACGGACT ACTCCTTCTA CGTACCGCCC
CGCGACACCG CGGACGGGCC CGGAGACACC GGCGGCGATG GGATCCCCGA CCTGCTCCTC
GTCGATTCCA CCGGCAACCT ACGGAACTAC GCGGGTGACG TGGACGGCGA ACTGTACGCC
TGGCAGGCCG CCTCCTACAC CGGGGAGGGA ACGCTCAACC CGCCCGGTCA CTGGTACGAC
CCGGAAACCG ACACGGCCGC GCTGATCACC AAACACTCCG ACGCCTACCC GGGTGACGGC
TCCACCGACC TGTTCGCCCG AACCCCGGAC GGTGGCTTCT GGCTCTACCC CGGCGACGGG
TACGGCACCT TCAACGTCGA CGACCGGCTA CGCGTCCTGC TGCCGGACAA CACACCCGAT
CCCGCGACCT GGACCCAGAT CAAGGCGCTC GGCGACGTCA CCGGCGACGG GCACCCCGAT
CTGGTCCTAC GGGCCGGGAC TGCGTTCTGG ACGCTGAGCG GTTACACGGG CGCCAGCTTC
CAGGAAGCGA TCCTGATGAA CGGGAACGCG TGGGCGCGCC GGGAGATCGT CAACGTCGCG
GACATCGACC TGGACAGCAC CCCGGACCTG CTCTGGCGGA ACCTGGACAA CGGCAACATG
TACATCCGCC ACGGGAAACC GGGCGCGGTC ACCGGCAGCG TCGATCTGGA TTCGATCAAA
CTCGCGGCGA ACTCCCGTGA GGGCGACGTC TCCTACGGCG TCAGCTGGAC GGAAACCAAC
GTCAACGCGG TGATCGGTAT CCCCGATGTG AACGGGAACG GCGTCCCTGA CCTGTGGGCC
CGATTCGGTC AGGACGGCAT GATGCGGATC TACCATCCGT CGACCATCAA CACCCACGGC
CCAGTGAAGA TCGTGCTGGG GGACGACTGG AACGGCGTCA AGGCCTTCGG CTGA
 
Protein sequence
MTAPRLISPF VRTRTRLALT LGLLLTAVVT ALLPWWPTAD EPPKGVVSIA AAPLKDEAAA 
MAKALSTGKE VLVETATSAT SLTWALPNGQ LRSTFHATPQ RTKSTAGRWR PVDTTLTRTD
TTPDGLGIRP VNAVSPVRFS AGTRASDQAD GAGEGQAPVG DGETVLAEAD VDGHTIAFTW
PGHLPEPVLD GPRALYPDVL PGVDLLVVAR DVGGFGQLLI VKNRAAETIK AAGAVTYGLR
SETAVFRHNA TTGGIQVLDR TGQEVGSVPT PFAWDSAGRV DPDTRIRTAV DTPADVLELT
GLTGSEPGAR NAQIPTRVDG DGTGALHLHL DAAATGLLSD PDTLFPVFLD PTLNSGVVDW
ATVYSQYPTT NTWNGTNFNS GTTDARVGYI SSVPLRTRSF WRMGFSSSLR GATVSSATFK
VLNNHSYNCE RREMQLWLVG SISSGTTWNA QPSYMALQQK LAFAHGYGSS CADEYVSFNV
KNAAQRGADG GWSSFNLGMR ATSESDTKTW RKFKASSASL SVTYNRAPNT PTSLTASPGG
ACAPTGVTVA KTDLTLSATA TDPDGNLKGL RFRFWKSGSA VPTGTLVTTT SAGKASLTVP
STTLVDEGVY LWNVRAEDTS NAASGWNPPS TPCTLTVDAS APPAPVVDSD VFLEATPDGA
TWATVKFGQT GPVTFTAAGA ARFSYAFEAI GTTYVDATDG TATVPDLKPR HAGPTTLHVY
AYDNVGNKSA RTDYSFYVPP RDTADGPGDT GGDGIPDLLL VDSTGNLRNY AGDVDGELYA
WQAASYTGEG TLNPPGHWYD PETDTAALIT KHSDAYPGDG STDLFARTPD GGFWLYPGDG
YGTFNVDDRL RVLLPDNTPD PATWTQIKAL GDVTGDGHPD LVLRAGTAFW TLSGYTGASF
QEAILMNGNA WARREIVNVA DIDLDSTPDL LWRNLDNGNM YIRHGKPGAV TGSVDLDSIK
LAANSREGDV SYGVSWTETN VNAVIGIPDV NGNGVPDLWA RFGQDGMMRI YHPSTINTHG
PVKIVLGDDW NGVKAFG