Gene Sare_1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1924 
Symbol 
ID5708277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2220394 
End bp2222244 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content70% 
IMG OID641271429 
Producthypothetical protein 
Protein accessionYP_001536800 
Protein GI159037547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.348588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00176923 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCCGTC ATGGCTGGCC GAAATCAACG GGCAGCAAAG CCCGCTACGG CCCCGCGGCG 
ATGGAGAACA TGCGTCAAGC CGTCGCCGCT GGCGGGTGGC GCCCGACACC GGAACTGATC
GAGCAGCTCA TGGGCTTTCC ACCCGGCTGG ACGTCGATCG ACTCCAAGCC CTCGGCAACG
CCGTCGTCCC CCACGTCGCC GAACACATCG GCCGCATCAT CCTTGACCAC CACGAAAGCC
AGTGACCTCA TGACACCAGC ACGACTGCAC CTGGTCACCG ACCAGCCGCA ACCCACCACC
AGCAAGGAAA CGACAGGTGG CCCACTCTGG GATGTTCCAG TGCCCCTAAC CGGGCCGTCC
ACCGCGCCGC CGCCGTTCCC GGCCGACGTG TTTCCGACCT GGCTGCGCGA CATGGTGACC
GGCGTCGCCC GGTTCACCCA AACCGACCCG GCCATGGCCG GCACCCTCGC CGTTGCCGTG
CTCTCTGCCT GTGCGGGCGG CCGGTTGGAG GTCGAACCGG TGCCCGGATG GCGGGAGCCG
GTCAACGTGT TCGCCGCCGT CATCGCCGGA CCCGGTGAAC GCAAATCCCC GGTGCACCGC
ACCATGACCG CACCCCTGTT CAGCGCCCAA TCAACCCTCG CCGAAGCGGT ACGCCCGAGG
ATCGCCGAAG CCTCCGCGCT GCGCGACATT GCCGACCGGC AGGCCGAACA GGCCAAAGCC
CAAGCGGCCA AGGCCACCGA CCCGGGCAAG CGCGATGAGG CCGCCGCCGA GGCGGTAGCC
GCCGCTATCT CCGCTGAGGC CATTACCGTG CCCGGCCTGC CCCGGCTGAT CGTGGACGAC
GCCACCCCCG AGGCCCTGAT TGGGTTGATG GCCGCCAATG GCGGACGCAT GGCGATCATC
TCCGATGAGG GCGGCATCTT CGACACCCTC GCCGGCCGCT ACTCCGGCGC ACCCAACCTC
GACCCCTACC TGAAAGGCCA CGCCGGACAA CCGATGAGCA ACGAACGGCA AACCCGCGAA
GGAGCCACCG TCGACAAACC CGCCCTCACC GTCTGCGTCA TGGCGCAGCC CTCGGTGCTG
CGCAAGTTCG GCGGGAACAC CGAGCTCGCC GGACGCGGAC TGCCCGCCCG GTTCCTGTTC
GCCCTGCCCC GCTCCCTCGC TGGCTACCGG GCGGTCGACA CGCCGCCGAT CCTGGAGACG
GTGACCGCCG GCTACCGGCG CCGGGTGCAC GATCTGGCCG CCACCCTCGC CGATCGGGAA
GACCCCGCCG TCGTGGTCCT CACCGAGGAA GCCGGCAGGG TGCGGCGGGC CGCCGCCGAA
CAGGTGGAGG CCGAGCTACG GCCCGGCGGG AGCCTCTACG ACATGCGGGA GTGGGGCAAC
AAGCTCTCCG GGGCAACCCT CCGGCTGGCC GGGCTGCTGC ACGTCGCCCA CCACCCCGCC
GACGCCTGGC GATGCCCCAT CGACGCCGAC CGCATGGCCG ACGCCGTACG CCTCGCCGAG
TTCTTCGCCG CCCACTACCG GGCCGCGCTC ACCACGATCG GCAGTGACAC CGCAATCGAA
CACGCCCGGT ACGTGCTCGG CGTACTCACC ACCAAGGGCA TGAGCACCTT TACCCGCCGG
GAGCTGCACC GCAAGGTGTC CCGCCGGCTC CCGAAGTCCG ACGAGGTGTC GGCAGTGCTC
GCCGAGTTGG CCGCCCTCGG GTGGGTCCGC AACGGACCGG ACGGCCGATA CGAACTGCAC
CCCCGCGCCG TCGCGGAGGA CCCCGAAAGC GTTGACACGC TGACACCCGT CCCGACCGGC
GACGTTTCCG CAGCTCACAG CACGTCCGAG GGTGTCAACG CCCCCCGTTG A
 
Protein sequence
MGRHGWPKST GSKARYGPAA MENMRQAVAA GGWRPTPELI EQLMGFPPGW TSIDSKPSAT 
PSSPTSPNTS AASSLTTTKA SDLMTPARLH LVTDQPQPTT SKETTGGPLW DVPVPLTGPS
TAPPPFPADV FPTWLRDMVT GVARFTQTDP AMAGTLAVAV LSACAGGRLE VEPVPGWREP
VNVFAAVIAG PGERKSPVHR TMTAPLFSAQ STLAEAVRPR IAEASALRDI ADRQAEQAKA
QAAKATDPGK RDEAAAEAVA AAISAEAITV PGLPRLIVDD ATPEALIGLM AANGGRMAII
SDEGGIFDTL AGRYSGAPNL DPYLKGHAGQ PMSNERQTRE GATVDKPALT VCVMAQPSVL
RKFGGNTELA GRGLPARFLF ALPRSLAGYR AVDTPPILET VTAGYRRRVH DLAATLADRE
DPAVVVLTEE AGRVRRAAAE QVEAELRPGG SLYDMREWGN KLSGATLRLA GLLHVAHHPA
DAWRCPIDAD RMADAVRLAE FFAAHYRAAL TTIGSDTAIE HARYVLGVLT TKGMSTFTRR
ELHRKVSRRL PKSDEVSAVL AELAALGWVR NGPDGRYELH PRAVAEDPES VDTLTPVPTG
DVSAAHSTSE GVNAPR