Gene Sare_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4086 
Symbol 
ID5704739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4644959 
End bp4646443 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content75% 
IMG OID641273512 
Producthypothetical protein 
Protein accessionYP_001538867 
Protein GI159039614 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00423288 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0380271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGA TGTCCATGCC ACGGCAGCTG CCGGTGTGGC TGCCGCCAGC CCTGCTGGCG 
CTCGCGGTAA CCCTCACCGG AGTGACCGGG GCACAGCTCT GGCGGGACGA GTTGGCGACC
TGGAGTGCCG CCACCCGCCC GGTCGGCGAC CTGGTCCGGC TCGCCGGCAC GATCGACGCC
GCGACCGGAC CGTACTACCT GTTCATGCAC GCCTGGGTGA CCGGAGTGGG CGATTCGGTG
GTCGCCCTGC GGCTACCGGC CGTGCTCGCC ATGACCGCGA CCGCGGCGCT GACCGCGGTG
CTCGGCGCAC GGCTGTACAG CCGGTCGGCC GGGCTGCTGG CCGGGCTGCT CTTCGCCCTG
CTTCCCAGCA CTTCCCGGTT TGGGCAGGAG GCCCGCCCGT ACGCGCTGGC CACCGCCCTC
GCTGTCCTGT CGACGCTGCT GCTGGTCACC GCGCTGGACC CGCCGCCCGG CAGCCGGCCG
GCACGACGTT GGGCCCGCTG GGCCGGTTAC GCCACCGCGC TGGCCGCGCT GGGCCTGACC
CACCTGGTCG CCCTCACCCT GCTTCCGGCG CACGCGGTGG TGGTCCTCGC CACCGCTCGG
GGGCAAGCTG GGCGACAGCA GGCCCTGGGC GGGAGCGGCG CGCACCGGAG CATCGTCCGA
CCCTGGCTGC TCTCGCTCGT TCCGGTGGTG CTGCTGGTCG GCCCGCTGGT AGTGGTCGCC
CACGGTCAGC GCGCCCGCCA GCTCGACTGG GTTGACGCCG CCCGTCCCGC CGACCTCGCC
GCCCTGGCCG GCGGGGTGAC GCAGAGCGGG GTTGTCGGTG GCCTGTTGGT CGGGCTCGCC
GCGCTCGCTG TCGCGGCGCT GGGACGGGCG GCGCTGCTGC CCGGGACGGC CGTGCTCCTA
CCGGTGCTGC TGGTCTTCAC CGTCGGCGCG CTGGTTCCCC TCTGGGTACC CCGATACCTG
GTCTTCGTCG TGCCGTTCGG GTGTCTGCTG GCCGGTGTCG CGCTGGCCGG GGTGCCGTTC
CTTCCAGCGC TGACCATCGT GGCCCTGGCT GGGGCGCTCG GCCTACCGGC CCAGGCCGCG
TTGCGGCGTA CCCACGAGTG GCCCCGCTCG GCACTGGTTG ACTACGCCGG GGCGGCCCGG
ATCGTGGCGG ACGGGCAGCG ACCCGTCGAC GCGATCGTCT ACTCGCCCCG AGACAGCTGG
CTCTTTCTCG ACCTGGGGAT GGCGTACCAC CTGGACGACC ACCGGCCCCG GGACGTCCTG
CTCACCGCCA GCCCGGCACG CCGGGGCGAC CTGTGGGCCA CCGAATGTGC CCGCCCGGCG
CAGTGCCTGG CCGGCGTGGA CCGAGTCTGG CTGGTGATGG CCGGCAGGCA CGGCGACCCG
CTCGCCGCCG TATCCGGCGC GAAGGGGGAC GCGCTACGGG CCGGGCGCAC GGTCGAGCAG
GTCTGGCACC CGCCCGGACT GACTGTCGCC CTGATCCGCC GGTAG
 
Protein sequence
MGAMSMPRQL PVWLPPALLA LAVTLTGVTG AQLWRDELAT WSAATRPVGD LVRLAGTIDA 
ATGPYYLFMH AWVTGVGDSV VALRLPAVLA MTATAALTAV LGARLYSRSA GLLAGLLFAL
LPSTSRFGQE ARPYALATAL AVLSTLLLVT ALDPPPGSRP ARRWARWAGY ATALAALGLT
HLVALTLLPA HAVVVLATAR GQAGRQQALG GSGAHRSIVR PWLLSLVPVV LLVGPLVVVA
HGQRARQLDW VDAARPADLA ALAGGVTQSG VVGGLLVGLA ALAVAALGRA ALLPGTAVLL
PVLLVFTVGA LVPLWVPRYL VFVVPFGCLL AGVALAGVPF LPALTIVALA GALGLPAQAA
LRRTHEWPRS ALVDYAGAAR IVADGQRPVD AIVYSPRDSW LFLDLGMAYH LDDHRPRDVL
LTASPARRGD LWATECARPA QCLAGVDRVW LVMAGRHGDP LAAVSGAKGD ALRAGRTVEQ
VWHPPGLTVA LIRR