Gene Sare_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1212 
Symbol 
ID5706514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1362311 
End bp1363543 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content67% 
IMG OID641270729 
Productputative phiRv2 prophage protein 
Protein accessionYP_001536110 
Protein GI159036857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.131322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACC AACCGACCAA CACTGACCAG CCCCACGATG GCGCGGCAAT TCTCGACGTT 
CTGCACGCCT GCCTCACCAA ATACGTCATT CTCCCCAGCC CTGAGGCCGT CGACGCGGTG
GCGCTGTGGA TCGCCGCCAC CCACGCCCAA ATCGCGTGGG CGCACGCTCC CCGCCTAGTG
ATCCGGGCAC CAGAGAAGCG CTGCGGAAAG TCGCGGCTGC TCGACATCGT GGAAGGCACC
TGCCACGACC CGCTCATCAC CGTCAACGCC AGCCCCGCAG CCGTCTACCG GGCCATCGGT
ACCGGCCACC CACCCACGCT GCTGGTCGAT GAAGCCGACA CCCTTTTCGG CGGGAAGAAC
GCCGACGCAA ACGAAGACCT ACGCGGACTA CTCAACGCCG GACACCAACG CAACCGCCCC
GCCATCCGCT GGGACAACAA CACTCAAAGC TTGGAGAAGA TCCCCACCTT CGCCATGGCT
GCCCTCGCCG GAATCGGCGC CATGCCCGAC ACCATCGAAG ATCGCGCCGT GGTCATTCGC
ATGCGCCGCC GCGCACCCGG CGAAACCGTC GCACCATACC GACACAAGCG CGACGGCCCC
GCCCTACGCG CCGTCGCCCA GCAACTGGCC CAATGGCTAC ATACCAACCT CACCACGCTC
GAGGTCGCGG AGCCACCCAT GCCGGTCGAG GATCGGGCCG CCGACACCTG GGAACCCCTG
GTGGCTGTCG CCGACCTCGC CGGGGGCGCC TGGCCTCAAC GCGCCCGACA GGCGGTAGCC
ACGCTGACCG CCGAAGCCGA CGGATCGGGG AATGTCTCCC ATCGGGTACG CCTACTCGCC
GACATCCGCA CCGCCTTTAC CACCCTCGGC GACCCAACCG CCGCGCCCAC ATCGGATCTA
CTCGCCGCAC TCAACGGCGA CCCCGAGGCA CCCTGGGCCG ACAGCGGGCC CAACGGACTT
ACCGGCAAAA AGCTTGGCGA CCTGCTCCGT GAGTTCGACA TCCGCTCCGA GACGGTTCGC
TTCCCCGTCG GGCAGGCCAA GGGGTATACC CGCGACGCCT TTACCGACGC CTGGCAGCGC
TACTGCCCGA CATCCGAAAC CCCTTCCACC GAGGTATCCG TACCATCCGT ACCAACGTCA
TATCCGCAGG TCATCCCCGG TACGGATTAC ACCGCTGGTA CGGATCGATC CGTACCACAC
CAACCCCACG CCAGCGCCCC TGGTACGCAT TAA
 
Protein sequence
MTHQPTNTDQ PHDGAAILDV LHACLTKYVI LPSPEAVDAV ALWIAATHAQ IAWAHAPRLV 
IRAPEKRCGK SRLLDIVEGT CHDPLITVNA SPAAVYRAIG TGHPPTLLVD EADTLFGGKN
ADANEDLRGL LNAGHQRNRP AIRWDNNTQS LEKIPTFAMA ALAGIGAMPD TIEDRAVVIR
MRRRAPGETV APYRHKRDGP ALRAVAQQLA QWLHTNLTTL EVAEPPMPVE DRAADTWEPL
VAVADLAGGA WPQRARQAVA TLTAEADGSG NVSHRVRLLA DIRTAFTTLG DPTAAPTSDL
LAALNGDPEA PWADSGPNGL TGKKLGDLLR EFDIRSETVR FPVGQAKGYT RDAFTDAWQR
YCPTSETPST EVSVPSVPTS YPQVIPGTDY TAGTDRSVPH QPHASAPGTH