Gene Sare_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0453 
Symbol 
ID5705450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp519725 
End bp521992 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content76% 
IMG OID641269978 
Producthypothetical protein 
Protein accessionYP_001535373 
Protein GI159036120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.968901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000555714 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGTGATG CAGGGCAACC CGGTGCCGCG ACCCCGGGAG AGCACGGTGA CGGGACCGGT 
CCGCGGCCGG ACCGACCCCG TTCTGGCATT CCGTGGGCGC CCGCCACGGG CGGCTGGTCG
TCGGACTCGA CCCCACCCTG GCACCGATCC GATCCGCCCG CCAGTTGGGC CAACTCCTCG
TCGCGCCACG GGGACCTGCC AGCTCCTTCG CCCGGTGTGC CCGACAGCAC CTCTCGCCCT
CGCCTCAACG GGTTCCGGGT CAACGGCCAC TCGTACCCCG AGCCGGAGGC CTCGGCCGGC
CGGCCGGGTC CGGTGAGCGC GCCGCCCCAG GACGCCGCCG GTGTCGGGTC CGCGGTGACC
GCGCCCGCCG TCGAGCAGCC ACCCGTGCCG GCCGAACCGT ATCCGGTCCC TGGCCGGCGT
GCCTCCGACG AGTCGGCCGG GCCACCCCGC CGCTCCGAAC CCGGCTCGGT GCCGCCCACC
GTCCGGGTGT CACCGCACGA CACGGCGGCG AAGGGCTTCG AGGTGCCGCC CGGGTTCCAC
CCGTCAGCCA GTGGCTGGAA AACCGAGTCC GCGGGAGGCG GCGGGTCGGA CCATCCGGCC
GACACCGGCC CCACGCCGCT ACCCGAACCG GATTGGTCCG ACCCGAGCTG GAACCGGCCG
AGCTGGGGGT CCGAGTGGGC GCCGCCGTGG GCAGACGGTG AGTCGACCTC GTTCGGTGGC
CAGGCTGTAG CAGAGCCCGA CCCGACCGGG TGGGACGACG CGGAGTCGCG TGGCCGGCGG
TTCCGGGGTG GGATGGAGCG GGACGAGTCA GCGGAGCAGT CGGAGACGAC CGGCCGCCGG
TTCCGTCCGG ACCCCGACGT CACCGAGGCG GGCGAGGGGC GTCGGGTGCG TGCGGAGCGT
CCCGCCTGGG CCGTCGAGCC GCAGCGGGCG TACCAGCCGA TCCCGGCGCA CCGCTCACCG
GAGGTCCCTG CGGCCGGAGC CGCCGAGCAG CCGGCCGGGC CGGCGGAGGC CGCGCCCGCC
GAGAGTGGCC CCGGCGGCTG GGACCGCGTC CCGTCCGGGT CCGTTCCGAC GAGCACACCC
CCCTACACCG CCCGTCGATC TGCCCCCGAA CCGGCACCGA CGGCACAGTC GGCCCAGGCC
GACCGACTGC CGGATTCGGC GTCGGCGATG TTGCCGCAGC GTGTCCCCGC CAAACCGGAT
GTGCCCGTCG TGCCGGAGCC GCCAGCCGTG GAGCCACCCG CCCAGACACC GGAACTCGCC
CGTATCGCCA CCCACCTCCG TCAGGCCGAC GAGCCTCCTT CGCTGCGTGA GCGTCCGGAG
GGCTTCGACG TGGACGCCAT CCTCGGTGCG GTTCGTGGCG TGGCGGGTGT CCGCGACGCG
GCGTTGCGCC GTACCCCGGC CGGCGCACAC AGCCTGCGGC TGGACCTCGC CGACGGTGCG
GACCCGGCCG AGGTGAGCCG GCACGTCGCC CGGCTCCTGC AGGAACGGAT GGGCCTCGCC
GCCGCCCCGC AGACCATGTC CGACGAGCAC CCCGAACCAG TGCCGCGCGC CCGCCGGTGG
GCGGCCGAGC CGGCCCGCGA CGAACCGGGG CCGGCAACGA CCGGATGGTC TCCGGGTGGC
CTGCCGTCGC GCACCGAACG TCGGGGCGCG GCGCTGTCGG AGCCTCCGCG CCGTCGGCGG
CACCCGGGCG GAACGCACCG GGGACGGGCC GTCGTGGACG CGTTGACGGA TACTCCCGGC
GGAGCCACCG ACGGCCCGAC GACGCTCGAG GCGTCGTACA CCGGCGGGCA GGTCAGCACC
ACCGAGACGG CCCCGTCGCG CCCGCTGGAC ACGGGTGGTC GGCCCGGTCC CCGGGTAGTG
CTCGACCACG TGCAGGTGAG CACGTTCGGG CTCGACGCCA CGGTGGAGGT CCGCCTGGTC
TCCGCCGGGT CGCCCGCGGC CGGGTACGCG ACCGGGCCGG CGGTTGACGG GTACGTGTTG
CGGCTCTGCG CGGCGGCCGC CGCCGCAGCG GTGGACGAGT TGCTGCGCGA ACCAGGGCCG
AAGCCGGACC GGGGGCGCTG CTTCGTCGAG CACGCCACGT TGGTGCCGCT TGGCACCTGC
GAGGTGGCGA CGGTCGTGGT ACTGCTGGTC TGTGACGGTT GGGTCGAGCA GCTCGCCGGC
TCCGCGTTGG TCGCAGGTGA CCCTCGGCAG GCGGTCGTAC GGGCCACGCT GGCCGCGGTG
AATCGTCGCC TCGAGGCGTT GCTCGTCGAA ACCGGGGACC TCGGGTAG
 
Protein sequence
MSDAGQPGAA TPGEHGDGTG PRPDRPRSGI PWAPATGGWS SDSTPPWHRS DPPASWANSS 
SRHGDLPAPS PGVPDSTSRP RLNGFRVNGH SYPEPEASAG RPGPVSAPPQ DAAGVGSAVT
APAVEQPPVP AEPYPVPGRR ASDESAGPPR RSEPGSVPPT VRVSPHDTAA KGFEVPPGFH
PSASGWKTES AGGGGSDHPA DTGPTPLPEP DWSDPSWNRP SWGSEWAPPW ADGESTSFGG
QAVAEPDPTG WDDAESRGRR FRGGMERDES AEQSETTGRR FRPDPDVTEA GEGRRVRAER
PAWAVEPQRA YQPIPAHRSP EVPAAGAAEQ PAGPAEAAPA ESGPGGWDRV PSGSVPTSTP
PYTARRSAPE PAPTAQSAQA DRLPDSASAM LPQRVPAKPD VPVVPEPPAV EPPAQTPELA
RIATHLRQAD EPPSLRERPE GFDVDAILGA VRGVAGVRDA ALRRTPAGAH SLRLDLADGA
DPAEVSRHVA RLLQERMGLA AAPQTMSDEH PEPVPRARRW AAEPARDEPG PATTGWSPGG
LPSRTERRGA ALSEPPRRRR HPGGTHRGRA VVDALTDTPG GATDGPTTLE ASYTGGQVST
TETAPSRPLD TGGRPGPRVV LDHVQVSTFG LDATVEVRLV SAGSPAAGYA TGPAVDGYVL
RLCAAAAAAA VDELLREPGP KPDRGRCFVE HATLVPLGTC EVATVVVLLV CDGWVEQLAG
SALVAGDPRQ AVVRATLAAV NRRLEALLVE TGDLG