Gene Sare_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3106 
Symbol 
ID5706580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3529981 
End bp3531336 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content69% 
IMG OID641272539 
Producttransposase IS4 family protein 
Protein accessionYP_001537907 
Protein GI159038654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.99267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0276245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGAGA TTGCCATAGC CCGGACGGTC ACGGTAGCTG CCGGGTCGTT CGCGGCGGGT 
CACCTCGGTG AGCTGACCCG CCTCGTGCCG TTCGAGATGA TCGATGACGT GCTGGCTGCG
ACCAGGCGCA CACAGCGACG TGTCCGCCTG TTGCCGGCCC GGGTCGTGGT GTATCTGCTG
CTGGCCGGCT GCCTGTTCGC CGATTGCGGC TACCGGCAGG TGTGGGCGAA ACTCGTCGCC
GGTCTGCGCG GGCTGCCAGT CGCCGATCCC AGCGACAGCG CGTTGCGGCA GGCCCGGCAA
CGACTCGGCC CGGCCCCACT GCGGGCCTTG TTCGACCTGC TGCGCGGCCC CGCGGCCACC
AGTGCTGTCG CTGCGGTGCG ATGGCGGGGA CTGCTACCGG TCGTCGTCGA CGGCACCATG
ATCGCTGTCG CGGACTCGCC GGCCAACCTG GGTCGCTACG GCAAACACCG GTGCAACAAC
GGCGGCTCCG GCTACCCCAC ACTGCGGCTG AGCGCCCTGT TGACGTGCGG CACCCGCTCG
GTCATCGACG CCGTGTTCGA CCCGAGCACC ACCGGCGAGA TCACCCAAGC CCACCGGCTG
ACCCGCAGCC TGCGCGCCGG AATGCTGCTG CTGGCCGACC GCAACTACGC CGCCGCCGAC
CTAATCGGCG CGTTCACCGC CACCGGAGCG GACCTGCTGA TCCGCTGCAA GAGCGGCCGG
AAACTCCCGA TGACCCGCCG CTGTCGAGAC GGATCCTGGC TGTCGGTCAT CGACGGCCAG
CCGGTGAGGA TCATCGAGGC CCGGATCAGC ATCACCACCA CGGCCGGCAG CCACACCGGC
GACTACAGGC TCATCACCAC CCTGCTCGAC CCACGTCGCT ACCCCGCCGC CGACCTCGTC
CGCCTCTACC ACCAGCGGTG GGAAATCGAG ACCGCCTACC TGGAACTGAA GTCCACCATC
CTCGGCGGCC GGGTGCTGCG CGCCCGCACC CCCGACGGCG TCGACCAGGA GATCCACGCC
CTGCTCATCG TCTACCAGGT GCTGCGCACC GCCATGGTCG ACGCCACCGA CAGCCGGCCC
GGCCTCGACC CGGACCGGGC CAGCTTCACC ACCGCCCTGC ACGCCGCCCG CGACCAGATC
ACCCAGGCCG CCGGCATCAT CGCCGACACC GTCATCGACC TGGTCGGCGC CATCGGTGAA
CGCGTCCTGA CAGACCTGCT GCCCGACCGT CGCATCCGAT TCAAGGCCCG CATGATCAAA
CGCTCGAACT CCAGGTACCA GGCCCGCGGA CCCCGGATCG ACCGCCGAAC CTACAAGGCC
ACCACCAGCA TCGACGTCAT CACCAACGAC CCTTGA
 
Protein sequence
MDEIAIARTV TVAAGSFAAG HLGELTRLVP FEMIDDVLAA TRRTQRRVRL LPARVVVYLL 
LAGCLFADCG YRQVWAKLVA GLRGLPVADP SDSALRQARQ RLGPAPLRAL FDLLRGPAAT
SAVAAVRWRG LLPVVVDGTM IAVADSPANL GRYGKHRCNN GGSGYPTLRL SALLTCGTRS
VIDAVFDPST TGEITQAHRL TRSLRAGMLL LADRNYAAAD LIGAFTATGA DLLIRCKSGR
KLPMTRRCRD GSWLSVIDGQ PVRIIEARIS ITTTAGSHTG DYRLITTLLD PRRYPAADLV
RLYHQRWEIE TAYLELKSTI LGGRVLRART PDGVDQEIHA LLIVYQVLRT AMVDATDSRP
GLDPDRASFT TALHAARDQI TQAAGIIADT VIDLVGAIGE RVLTDLLPDR RIRFKARMIK
RSNSRYQARG PRIDRRTYKA TTSIDVITND P