Gene Sare_5041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5041 
Symbol 
ID5707312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5708162 
End bp5710039 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content71% 
IMG OID641274434 
Producthypothetical protein 
Protein accessionYP_001539775 
Protein GI159040522 
COG category[R] General function prediction only 
COG ID[COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.025652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCGC GCCGTCCCAC CGTCGTCGCG GGCACGCTGC TGGCGCTCCT GCTGCTCGCG 
GGCAGCGCCG CCGCGACGCG TCCACCGGCA CCGAGCAGGC CGCCCGCCGG GCCGGGGTCA
CCGATCCAGT TGGTCTCCTT CACCTCCTGC GCCGACGCGC TGGCCGAACT CCGCGCCGTG
ACCACCGCCG CCGTCAATCC GCGAGGGCTC CCCGGCGAGG CCCTCCCCCT TTCGTCGGGC
CCCACCGACG ACGTCGCCAA GTCGGCGCCA GCCAGCGAGC ACTCGGTCAC CAACAGCTAC
GAGCCCGGCG TCGACGAACC GGATCTCGTC AAGACCGACG GGCAGCGAAT CGTCATGCTC
AGCCAACAGG GTGTGCTTCA TGTCGTTGAC CCCGTCACCA ACCGGTTCAC CGGGAGGCTG
AACATCAGCC GTGCCTCTTA CTGGGGTCGG TACGACCTGC TCCTGCACGG CGATCACGCC
CTGATCCTCA CCGACGCGGA ACTTATGGTC CGCCCAGCGG TCGACGCCGG TGGCGATGAA
ACTGCCAGCG AGCCGGCCGG AATGAGCTTT CACGCCGAGC CCTCCACGAC CCGACTCCTC
CTGGTCGACC TGAGCGGCCC ACCCCGGGTG CTGGGCACAT ACAAGATCCG AGGCCGCACC
GTTGACGCCC GGCAGACCGG GAGCACCGTC CGGGTGGTGG TCCGGTCTCA CGCTCAGGTG
CCCTTCCCGG AGCTGCCCGC CACCGCCGAC GAGGCGGCCC GCGAGGCGGC CAACCGGGCC
GCGGTGGCTA CCGCGGGCAT CGAGGCGTGG CTGCCAACCT ACGAGTGGAC GGCCGGAACG
CAAAAGGGGA GCGGTCGAGT CGACTGCGAC CGGCTCAGCC GCCCGCAAAC CGGCACGGGC
TCCACCATGC TGACCGTACT CAGTTTCGAC CTCACCGCCG ACCGGCTCAC CGACGGAAAC
CCCGTCAGCG TGGCCGCCGA CGCGGACACC GTCTACAGCA CGGGCGGCAG CCTCTACCTG
GTGGGCCAGC GATGGGTGGA GGTGCCGCCG GCCCCGGACC GACGGCCCGG CCAGATCGGC
GAGGCGATCA CCGACATCTA CCAGTTCGAC ACCGCCGCTG CCGGCCGTCC CCGGTACGTC
GCCGCCGGCA CGATTCCCGG CCGCCTGATC AACCAGTACG CGCTGTCGGA GTGGCAGGGC
CACCTACGCG TCGCCACCAC CACAGGACAG GACGAACGCA CCTCGGAATC CGGCGTACAC
GTGTTGCGCC GGCAGGGCGA CACGCTGACC CCGACGGGCG CGGTCACCGG CCTGGGCCCG
GGGGAATGGA TCCGGTCGGT GCGCTATCTC GGCGACACCG CCTACGTGGT GACGTTCCGG
CAGACCGACC CGCTCTACGC GCTCGACCTG AGCGACCACA CCGCACCCCG AGTCACGGGC
GAGTTGAAGA TCACCGGCTA CTCGGCGTAC CTGCACCCGA TCGCGGACGG CCGGCTGCTC
GGCATCGGGC AGGAGGCCGA CCTCGACGGA CGCGTACAAG GTGTCCAGGT CTCACTCTTC
GACGTCCGGG ATCCGGCCCG ACCGCTCCGG TTGGATCACT GGCACCGCCC GAACGCCTGG
TCCGTGGCCG AGCACGACCC GCACGCCTTC CGATACGACC CGAAAACCGG GCTACTCGCC
GTTCCGGTCG ACGCCGGCCT GCGCCTGCTG CGGGTCTCCG GGGACACCCT CACCGACCGG
GGCGAGGTGA CTCACCCGGA GGGGGTCATC AGCCGGTCGT TGCTCGTCGG TGACACGCTG
TGGACGGTGT CGGACGTGGG CCTGCGGGCC AGCGACCCAA CGACCGGACG GAGCCTGGCC
TGGCTCCCCA CCACCTGA
 
Protein sequence
MRSRRPTVVA GTLLALLLLA GSAAATRPPA PSRPPAGPGS PIQLVSFTSC ADALAELRAV 
TTAAVNPRGL PGEALPLSSG PTDDVAKSAP ASEHSVTNSY EPGVDEPDLV KTDGQRIVML
SQQGVLHVVD PVTNRFTGRL NISRASYWGR YDLLLHGDHA LILTDAELMV RPAVDAGGDE
TASEPAGMSF HAEPSTTRLL LVDLSGPPRV LGTYKIRGRT VDARQTGSTV RVVVRSHAQV
PFPELPATAD EAAREAANRA AVATAGIEAW LPTYEWTAGT QKGSGRVDCD RLSRPQTGTG
STMLTVLSFD LTADRLTDGN PVSVAADADT VYSTGGSLYL VGQRWVEVPP APDRRPGQIG
EAITDIYQFD TAAAGRPRYV AAGTIPGRLI NQYALSEWQG HLRVATTTGQ DERTSESGVH
VLRRQGDTLT PTGAVTGLGP GEWIRSVRYL GDTAYVVTFR QTDPLYALDL SDHTAPRVTG
ELKITGYSAY LHPIADGRLL GIGQEADLDG RVQGVQVSLF DVRDPARPLR LDHWHRPNAW
SVAEHDPHAF RYDPKTGLLA VPVDAGLRLL RVSGDTLTDR GEVTHPEGVI SRSLLVGDTL
WTVSDVGLRA SDPTTGRSLA WLPTT