Gene Sare_1293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1293 
Symbol 
ID5706255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1504353 
End bp1505801 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID641270806 
Producthypothetical protein 
Protein accessionYP_001536187 
Protein GI159036934 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0149064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGAA GAACGGGCAT GGCGCGTCGT ACGCTCCACA TTCTCGCTGT CGCCCTATCC 
TTGTCCGGGG TTTTGCTCAC TGCCACTATG TCGCCCGTTT CGGCGAAGTC ACCCTCCGCG
AAGAATGCGG TGATCGATGA GCGGGTGGCG GAAATCTATG ATGTAACGCG TCGGTATGTG
ACAAAGTTCT ACCCTCGATG GTTCACGCAC GAACAACAGC GCATCCTGTT GCCCAACATT
CTTTTTGGGC CGGATCGGAT GTCCACGGTG TGGCGTGCCG TGGTGGCGCC CAATGATGAC
ACGTTGTACA CATCTGCGGT CATTGGTGTG CGCCACGAGC CAGCCATTTT GGTGATACCG
GAAACGACTG CTACCTACGG TGTCCTTGTC TTGGATGTCT ACGGGGACAT CATCAACACG
AACATCTCGA CGGATTCTGG CGGCACGTAC GCCTTTACGG GTCCAGGCTG GCGAGGGAAA
CTCCCTCCAA ACGTCATTAG GGTGCCACTT CCTGTGACGA GTGCAACCTT GATCATTCGG
ACTGACAAGT TCTCCTCGGA CGGGGAGAAC CAGATCGCAA TTGCTGAGGA GTTTCGGCGG
TCGCTGCGCA TGGGGCCGCT GTTGGCATAC CTGAGGGGTT CGTTGCGGGA AACCCTGATC
GTCCCCGTTT TTTTCTATGG TCGTTCGTTC AAGCGTGACG CGGATACCTT GATCGCAACC
AATCCGATTC GCTTTCTGGA GGAGCTTCAG CGGGCGGTCA AGAGCATCAG AACTCCACCG
CTCACTGGAG AAAACAGAAT CTTGTCGGAG AGGTTCGACA GGTTGTTCAG CAGGGGAGGC
TTCGAGAGGG TGTTCGCGCG AGCGACGCAG GACGCACACG CTGCGATCAT CCGTAATTAT
CTTTCGAACA CCGACGAGAA CAACTGGATC AATTTCCGTA ACATCGGCTT CTGGGGGCGC
AACTATCTCG ACCGGTCCTC GATCACCCAA TTCTGCCAGT ACTGCAATGA CATCGAGAAG
TCGGCGTACT ACGACGTGTT CCTTGACAAA AATGGTCGAC GGCTCAATGG TGCTTCCGGC
GGATACACTA TCACGTTCGA AAGGGATCAG ATTCCTCAGG CGGAACAGTT CTGGTCGGTC
ACCTCGTATA TCGCGGGTAC GATAAGTCTG GTCCCCAACC CGAAGGACAA GTGGGTTGTG
GCCAGTTATA CGCCAGGTCT CGAGCGAAAT AGAGATGGGT CTATTACCAT CTATGCAACA
CCAACCCCTC CGCGCGGGGT GAGCAAGGCC AACTGGCTAC CCATTCCCAA GGGGGACTTC
AGTATGATGC TCCGTGTCTA CGGTCCGCAG GGACGTGTCG CCGACGGCAC GTACGTCCCG
CCGCCGGTGA AGAAGATTGA CCGGCGCGAC ACCGGGCGAC GGCCGTATTC GGGCGGTGGT
AGCTGGTAG
 
Protein sequence
MVGRTGMARR TLHILAVALS LSGVLLTATM SPVSAKSPSA KNAVIDERVA EIYDVTRRYV 
TKFYPRWFTH EQQRILLPNI LFGPDRMSTV WRAVVAPNDD TLYTSAVIGV RHEPAILVIP
ETTATYGVLV LDVYGDIINT NISTDSGGTY AFTGPGWRGK LPPNVIRVPL PVTSATLIIR
TDKFSSDGEN QIAIAEEFRR SLRMGPLLAY LRGSLRETLI VPVFFYGRSF KRDADTLIAT
NPIRFLEELQ RAVKSIRTPP LTGENRILSE RFDRLFSRGG FERVFARATQ DAHAAIIRNY
LSNTDENNWI NFRNIGFWGR NYLDRSSITQ FCQYCNDIEK SAYYDVFLDK NGRRLNGASG
GYTITFERDQ IPQAEQFWSV TSYIAGTISL VPNPKDKWVV ASYTPGLERN RDGSITIYAT
PTPPRGVSKA NWLPIPKGDF SMMLRVYGPQ GRVADGTYVP PPVKKIDRRD TGRRPYSGGG
SW