Gene Sare_4736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4736 
Symbol 
ID5704561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5359357 
End bp5361288 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content71% 
IMG OID641274134 
Producthypothetical protein 
Protein accessionYP_001539480 
Protein GI159040227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00027921 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGGGG CCCGGACGCG GACGGTGGCG CTTCTGCTGG CCCTCGGGGT CCTCGCCGGG 
GCCTCGATCG GCTGGCCGGG GGTCGGCGCG GCGGCGGCAC CGGCCACCGG AGTGACCGCG
CTAGCCCCCT CCGGCCCCTG CACCACCGAG GAGTGGCAGG CCGACTTCCG CTCCTGCGTG
TCCCGGCTCA CGGAGGTCGC CGAGGACGAG GTCACCTGCC GTAACGCCCC GACTCCGACG
GCGCCGGACT CCGGCCTCGC GGGTTGGTTC GCAACCCGCC CGGATTCTGC GAAGCAGGAC
GGCCGAAAGG GACTCTACAG CGACTACGGC TACGCAGGGT ACAGCTACAG CACGTACGAC
ATCGACACCG GCTGCGCCAC CGCGGTGCTG CACCCCGACT ACAAGTTCAC CAACACGCTC
GCCAATGGCG AGTTCATGAT CGCCACCGCG ATCGTCGGCG CCTCGAACGC TCTCCGGGAG
CGGGCCTGGG ATCCGCGGTC GATGTGGGGC TGGGCCGATC CGCTGGTGGA CCAGGCCACC
AAGGCGGTCT ACCAGAAGGT GTTCAGTGTC TTCGGGATCG TCACGCTCTG CGTGGTCGGG
CTGTACCTGC TCTGGCGCTC CCGCCAGTCG GACATGAGCA ACGCCATGAC GACCGCGGGC
TGGGCCCTGC TCGTAATGGT GGCGGTGACC GCGTTGGCCG CCTGGCCGGT CAAGTCCGCC
AACGTCGCCG ACGGCACCCT CATCACCACG CTGGGCGTCG TCCACGACGC CGTTGGTCCG
ACGTCCAAGG ACACGCCGCC GGACCAGTGC CCGGTCCCGA ATCCGGAGGC CTGCGTCGAC
AACCGACCCC CGGCGGTACG GGCGAGCGAC ACGGCCACCG AGACCATGCT GTACCGCAAC
TGGCTGCGGG GAGTGCTGGG CTCCGCCGAC AGCGAGACCG CGAAGAAGTA CGGGCCAGCA
CTGTACGATG CCAAGTCCTT CTCCTGGGAG GAGGTGGAGA GGCTCCGCTC CAACCCAGCG
ACGCGTGATC TGACGATCAA ACTCAAGAAG CAGCAATGGG CGCGGGTCGC CCAGCAGATC
GCGCAGGAGG ATCCGGAGGC GTACGAGTAC CTCCAGGGGG TCCGGGACAT GGACCGGGTC
GGCGCCGGGT TCATCGCGGT GCTCGCCGCG CTGCTCTTCG CGATGTTCGA CCTCACCGCG
TCGGTGCTGG TGCTGTTGGG CTTCCTGATC TTCCGATGGG CGGTGATCGT CGCACCCATC
CTCGGCACCA TCGGCCTGCT GCGCCCAGCG AGCACCGGCC TGCGTCGGCT GGGCAACGCC
GTGGTCGCGG CCGTCTTCAA CATCGCCATC TTCGGCACCG GTGCCGCGAT CTACCTGTTC
GCCGTGGACC TCATCATGAG CACGCCAACC CTGCCCGGCT GGCTCCAGGT GGTCCTGGTC
TGGCTCTGCG GGCTGGTCGG CTGGCTGCTG CTGCGCCCCT ACCGGCGGAT CACCCAGCTC
GGCGGCAAGG GCGGCAGCGA CGCGGGTAGC TCCCCCGGCT CCTGGCACCG CCGGTTCTTC
CGCGACATGC GCGCCGCGGC TCGCCTCGAC GCCGCTGAAC CCCGCGGCAC CGGCGAGCCG
ACGTCGGGCC GTCGCAGGCC GGTGGTCGCG GAGCAGAGCA GGCTACGCCC GGAGGCCCGA
CACGAGGACC CGGTGCCCTC GTCCGATCGC GGTGACCGGC CGGACGGCCG CGGGCGGACG
GAGGACGTGG CGAGGTCCGG GGAGCGGACC ACCGAGGGCC GCACCCCCGC CCCACGGCAA
CGGCGGCAGC CGGCCACCTG GACCGAACCG GACGGCCCGC AGGAAAGCCC CTCCTTCGTC
ATCTACCGAC CGGGCACGGC GAAGCCCGCG CCCGAACGCA GCACTCCCCG AATTCGCTCC
GAGGCCCGGT GA
 
Protein sequence
MAGARTRTVA LLLALGVLAG ASIGWPGVGA AAAPATGVTA LAPSGPCTTE EWQADFRSCV 
SRLTEVAEDE VTCRNAPTPT APDSGLAGWF ATRPDSAKQD GRKGLYSDYG YAGYSYSTYD
IDTGCATAVL HPDYKFTNTL ANGEFMIATA IVGASNALRE RAWDPRSMWG WADPLVDQAT
KAVYQKVFSV FGIVTLCVVG LYLLWRSRQS DMSNAMTTAG WALLVMVAVT ALAAWPVKSA
NVADGTLITT LGVVHDAVGP TSKDTPPDQC PVPNPEACVD NRPPAVRASD TATETMLYRN
WLRGVLGSAD SETAKKYGPA LYDAKSFSWE EVERLRSNPA TRDLTIKLKK QQWARVAQQI
AQEDPEAYEY LQGVRDMDRV GAGFIAVLAA LLFAMFDLTA SVLVLLGFLI FRWAVIVAPI
LGTIGLLRPA STGLRRLGNA VVAAVFNIAI FGTGAAIYLF AVDLIMSTPT LPGWLQVVLV
WLCGLVGWLL LRPYRRITQL GGKGGSDAGS SPGSWHRRFF RDMRAAARLD AAEPRGTGEP
TSGRRRPVVA EQSRLRPEAR HEDPVPSSDR GDRPDGRGRT EDVARSGERT TEGRTPAPRQ
RRQPATWTEP DGPQESPSFV IYRPGTAKPA PERSTPRIRS EAR